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Preface 


These are notes for the lecture course “Differential Geometry I” given by the 
second author at ETH Ziirich in the fall semester 2017. They are based on 
a lecture course! given by the first author at the University of Wisconsin— 
Madison in the fall semester 1983. 

One can distinguish extrinsic differential geometry and intrinsic differ- 
ential geometry. The former restricts attention to submanifolds of Euclidean 
space while the latter studies manifolds equipped with a Riemannian metric. 
The extrinsic theory is more accessible because we can visualize curves and 
surfaces in R*, but some topics can best be handled with the intrinsic theory. 
The definitions in Chapter 2 have been worded in such a way that it is easy 
to read them either extrinsically or intrinsically and the subsequent chapters 
are mostly (but not entirely) extrinsic. One can teach a self contained one 
semester course in extrinsic differential geometry by starting with Chapter 2 
and skipping the sections marked with an asterisk like §2.8. = 

Here is a description of the content of the book, chapter by chapter. 
Chapter 1 gives a brief historical introduction to differential geometry and 
explains the extrinsic versus the intrinsic viewpoint of the subject.” This 
chapter was not included in the lecture course at ETH. 

The mathematical treatment of the field begins in earnest in Chapter 2, 
which introduces the foundational concepts used in differential geometry 
and topology. It begins by defining manifolds in the extrinsic setting as 
smooth submanifolds of Euclidean space, and then moves on to tangent 
spaces, submanifolds and embeddings, and vector fields and flows. The 
chapter includes an introduction to Lie groups in the extrinsic setting and a 
proof of the Closed Subgroup Theorem. It then discusses vector bundles and 
submersions and proves the Theorem of Frobenius. The last two sections 
deal with the intrinsic setting and can be skipped at first reading. 


‘Extrinsic Differential Geometry 


*Tf is shown in §1.3 how any topological atlas on a set induces a topology. 


3Our sign convention for the Lie bracket of vector fields is explained in §2.5.7. 


ili 


Chapter 3 introduces the Levi-Civita connection as covariant derivatives 
of vector fields along curves. (The covariant derivative of a global vector 
field is deferred to §5.2.2.) It continues with parallel transport, introduces 
motions without sliding, twisting, and wobbling, and proves the Develop- 
ment Theorem. It also characterizes the Levi-Civita connection in terms of 
the Christoffel symbols. The last section introduces Riemannian metrics in 
the intrinsic setting, establishes their existence, and characterizes the Levi- 
Civita connection as the unique torsion-free Riemannian connection on the 
tangent bundle. 

Chapter 4 defines geodesics as critical points of the energy functional and 
introduces the distance function defined in terms of the lengths of curves. It 
then examines the exponential map and the injectivity radius, establishes the 
local existence of minimal geodesics, and proves the existence of geodesically 
convex neighborhoods. A highlight of this chapter is the proof of the Hopf— 
Rinow Theorem and of the equivalence of geodesic and metric completeness. 
The last section shows how these concepts and results carry over to the 
intrinsic setting. 

Chapter 5 introduces isometries and the Riemann curvature tensor and 
proves the Generalized Theorema Egregium, which asserts that isometries 
preserve geodesics, the covariant derivative, and the curvature. 

Chapter 6 contains some answers to what can be viewed as the funda- 
mental problem of differential geometry: When are two manifolds isometric? 
The central tool for answering this question is the Cartan—Ambrose—Hicks 
Theorem, which etablishes necessary and sufficient conditions for the exis- 
tence of a (local) isometry between two Riemannian manifolds. The chapter 
then moves on to examine flat spaces, symmetric spaces, and constant sec- 
tional curvature manifolds. It also includes a discussion of manifolds with 
nonpositive sectional curvature, proofs of the Cartan—Hadamard Theorem 
and of Cartan’s Fixed Point Theorem, and as the main example a discussion 
of the space of positive definite symmetric matrices equipped with a natural 
Riemannian metric of nonpositive sectional curvature. 

This is the point at which the ETH lecture course ended. However, 
Chapter 6 contains some additional material such as a proof of the Bonnet— 
Myers Theorem about manifolds with positive Ricci curvature, and it ends 
with brief discussions of the scalar curvature and the Weyl tensor. 

The logical progression of the book up to this point is linear in that 
every chapter builds on the material of the previous one, and so no chapter 
can be skipped except for the first. What can be skipped at first reading 
are only the sections labelled with an asterisk that carry over the various 
notions introduced in the extrinsic setting to the intrinsic setting. 


Chapter 7 deals with various specific topics that are at the heart of the 
subject but go beyond the scope of a one semester lecture course. It begins 
with a section on conjugate points and the Morse Index Theorem, which 
follows on naturally from Chapter 4 about geodesics. These results in turn 
are used in the proof of continuity of the injectivity radius in the second 
section. The third section builds on Chapter 5 on isometries and the Rie- 
mann curvature tensor. It contains a proof of the Myers—Steenrod Theorem, 
which asserts that the group of isometries is always a finite-dimensional Lie 
group. The fourth section examines the special case of the isometry group of 
a compact Lie group equipped with a bi-invariant Riemannian metric. The 
last two sections are devoted to Donaldson’s differential geometric approach 
to Lie algebra theory as explained in [17]. They build on all the previous 
chapters and especially on the material in Chapter 6 about manifolds with 
nonpositive sectional curvature. The fifth section establishes conditions un- 
der which a convex function on a Hadamard manifold has a critical point. 
The last section uses these results to show that the Killing form on a simple 
Lie algebra is nondegenerate, to establish uniqueness up to conjugation of 
maximal compact subgroups of the automorphism group of a semisimple Lie 
algebra, and to prove Cartan’s theorem about the compact real form of a 
semisimple complex Lie algebra. 

The appendix contains brief discussions of some fundamental notions of 
analysis such as maps and functions, normal forms, and Euclidean spaces, 
that play a central role throughout this book. 

This document is designed to be read either as a .pdf file or as a printed 
book. 

We thank everyone who pointed out errors or typos in earlier versions 
of this book. In particular, we thank Charel Antony and Samuel Trautwein 
for many helpful comments. 


27 June 2021 Joel W. Robbin and Dietmar A. Salamon 
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Chapter 1 


What is Differential 
Geometry? 


1.1 Cartography and Differential Geometry 


Carl Friedrich Gauf (1777-1855) is the father of differential geometry. He 
was (among many other things) a cartographer and many terms in modern 
differential geometry (chart, atlas, map, coordinate system, geodesic, etc.) 
reflect these origins. He was led to his Theorema Egregium (see 5.3.1) by 
the question of whether it is possible to draw an accurate map of a portion 
of our planet. Let us begin by discussing a mathematical formulation of this 
problem. 

Consider the two-dimensional sphere S? sitting in the three-dimensional 
Euclidean space R?. It is cut out by the equation 

ee y? +27 =1. 

A map of a small region U C 5? is represented mathematically by a one-to- 
one correspondence with a small region in the plane z = 0. In this book we 
will represent this with the notation ¢: U > ¢(U) C R? and call such an 
object a chart or a system of local coordinates (see Fugure 1.1). 

What does it mean that ¢ is an “accurate” map? Ideally the user would 
want to use the map to compute the length of a curve in S?. The length of 
a curve y connecting two points p,q € S? is given by the formula 


1 
L(y) = i: WW()| dt, (0) =p, 71) =a, 


so the user will want the chart ¢ to satisfy L(y) = L(¢07) for all curves ¥. 
It is a consequence of the Theorema Egregium that there is no such chart. 
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—< 


Na, 
Figure 1.1: A chart 


Perhaps the user of such a map will be content to use the map to plot 
the shortest path between two points p and q in U. This path is called a 
geodesic and is denoted by pq. It satisfies L(Ypq) = du (p,q), where 


du (p,q) = inf{ L(y) | y(t) € U, 10) =p, yA) = 4h 


so our less demanding user will be content if the chart ¢ satisfies 


du (p,q) = dzu($(p), (4) 


where dz(¢(p), ¢(qg)) is the length of the shortest path in the plane. It is 
also a consequence of the Theorema Egregium that there is no such chart. 
Now suppose our user is content to have a map which makes it easy to 
navigate close to the shortest path connecting two points. Ideally the user 
would use a straight edge, magnetic compass, and protractor to do this. 
S/he would draw a straight line on the map connecting p and q and steer a 
course which maintains a constant angle (on the map) between the course 
and meridians. This can be done by the method of stereographic projection. 
This chart is conformal (which means that it preserves angles). According 
to Wikipedia stereographic projection was known to the ancient Greeks 
and a map using stereographic projection was constructed in the early 16th 
century. Exercises 3.7.5, 3.7.12, and 6.4.22 use stereographic projection; the 
latter exercise deals with the Poincaré model of the hyperbolic plane. The 
hyperbolic plane provides a counterexample to Euclid’s Parallel Postulate. 


Exercise 1.1.1. It is more or less obvious that for any surface M C R? 
there is a unique shortest path in M connecting two points if they are 
sufficiently close. (This will be proved in Theorem 4.5.3.) This shortest 
path is called the minimal geodesic connecting p and g. Use this fact to 
prove that the minimal geodesic joining two points p and q in S? is an arc 
of the great circle through p and q. (This is the intersection of the sphere 
with the plane through p, q, and the center of the sphere.) Also prove that 
the minimal geodesic connecting two points in a plane is the straight line 
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Figure 1.2: Stereographic Projection 


segment connecting them. Hint: Both a great circle in a sphere and a line 
in a plane are preserved by a reflection. (See also Exercise 4.2.5 below.) 
Exercise 1.1.2. Stereographic projection is defined by the condition that 
for p € S*\ n the point $(p) lies in the xy-plane z = 0 and the three 
points n = (0,0,1), p, and ¢(p) are collinear (see Figure 1.2). Using the 
formula that the cosine of the angle between two unit vectors is their inner 
product prove that ¢ is conformal. Hint: The plane through p, q, and n 
intersects the xy-plane in a straight line and the sphere in a circle through 
n. The plane through n, p, ¢(p), and the center of the sphere intersects the 
sphere in a meridian. A proof that stereographic projection is conformal 
can be found in [27, page 248]. The proof is elementary in the sense that 
it doesn’t use calculus. An elementary proof can also be found online at 
http://people.reed.edu/jerry/311/stereo. pdf. 


Exercise 1.1.3. It may seem fairly obvious that you can’t draw an accurate 
map of a portion of the earth because the sphere is curved. However the 
cylinder C = {(z,y,z) € R® | 27+ y? = 1} is also curved, but the map 
w : R? > C defined by ~(s, t) = (cos(t), sin(t), s) preserves lengths of curves, 
ie. L(w oy) = L(y) for any curve 7 : [a,b] + R?. Prove this. 


1.1.4. Standard Notations. The standard notations N, No, Z, Q, R, 
C denote respectively the natural numbers (= positive integers), the non- 
negative integers, the integers, the rational numbers, the real numbers, and 
the complex numbers. We denote the identity map of a set X by idx and 
then xn identity matrix by 1, or simply 1. The notation V* is used for the 
dual of a vector space V, but when K is a field like R or C the notation K* 
is sometimes used for the multiplicative group K \ {0}. The terms smooth, 
infinitely differentiable, and C™ are all synonymous. 
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1.2. Coordinates 


The rest of this chapter defines the category of smooth manifolds and smooth 
maps between them. Before giving the precise definitions we will introduce 
some terminology and give some examples. 


Definition 1.2.1. A chart on a set M is a pair (¢,U) where U is a subset 
of M and ¢: U > (VU) is a bijection from U to an open set o(U) in R™. 
An atlas on M is a collection 


A= {(¢a; Ua) }aeA 


of charts such that the domains Uq cover M, i.e. 


M= [. | ee. 


acA 


The idea is that if ¢(p) = (x1(p),...,2%m(p)) for p € U, then the func- 
tions x; form a system of local coordinates defined on the subset U of M. 
The dimension of M should be m since it takes m numbers to uniquely spec- 
ify a point of U. We will soon impose conditions on charts (¢, U), however 
for the moment we are assuming nothing about the maps ¢ (other than that 
they are bijective). 


Example 1.2.2. Every open subset U C R™ has an atlas consisting of a 
single chart, namely (¢,U) = (idy,U) where idy denotes the identity map 
of U. 


Example 1.2.3. Assume that 2 Cc R™ is an open set, that M is a subset 
of the product R™ x R” = R™*", and that 


h:Q—-> R” 
is a continuous map whose graph is M, i.e. 
graph(h) := {(2,y) €Qx R”|y=h(az)} = M. 


Let U = graph(h) = M and let ¢(z,y) = x be the projection of U onto 2. 
Then the pair (¢,U) is a chart on M. The inverse map is given by 


g*(a) = (x, h(a) 


for ¢ € Q = $(U). Thus M has again an atlas consisting of a single chart. 
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Example 1.2.4. The m-sphere 


has an atlas consisting of the 2m+2 charts ¢;4 : Uji — D™ where D™ is the 
open unit disk in R™, U;iz = {p € S™| +2; > 0}, and + is the projection 
which discards the ith coordinate. (See Example 2.1.14 below.) 


Example 1.2.5. Let 
A= Al ec R(mt+))x(mt+1) 
be a symmetric matrix and define a quadratic form F : R™+! > R by 
F@y=p' Ap, p= (oyx-52m) 
After a linear change of coordinates the function F’ has the form 


F(p) = 28+ taf aby — a 


(Here r + 1 is the rank of the matrix A.) The set M = F~1(1) has an atlas 
of 2m + 2 charts by the same construction as in Example 1.2.4, in fact S’” 
is the special case where A = I,41, the (m+ 1) x (m+ 1) identity matrix. 
(See Example 2.1.13 below for another way to construct charts.) 


Figure 1.3 enumerates the familiar quadric surfaces in 3. The para- 
boloids are examples of graphs as in Example 1.2.3 with Q = R? and n = 1, 
and the ellipsoid and the two hyperboloids are instances of the quadric hyper- 
surfaces defined in Example 1.2.5. The sphere is an instance of the ellipsoid 
(with a = b =c=1) and the cylinder is a limit of the ellipsoid as well as of 
the elliptic hyperboloid of one sheet (as a = b = 1 and c > oo). The pictures 
were generated by computer using the parameterizations 


x = acos(t) cos(s), y = bsin(t) cos(s), z = csin(s) 
for the ellipsoid, 
x = acos(t) cosh(s), y = bsin(t) cosh(s), z =csinh(s), 


for the elliptic hyperboloid of one sheet, and 


x = acos(t) sinh(s), y = bsin(t) sinh(s), z = +ecosh(s) 


for the elliptic hyperboloid of two sheets. These quadric surfaces will be 
often used in the sequel to illustrate important concepts. 
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Unit Sphere getyt227=1 


Ellipsoid ee a 


Cylinder ety =1 == 
Elliptic | ae 
Hyperboloid a aor 1 
(of one sheet) 

Elliptic | ge 
Hyperboloid 2 + oa 1 
(of two sheets) 

Hyperbolic P . 
Paraboloid z= > — z 
Elliptic ae 
Paraboloid Z= a + RB 


Figure 1.3: Quadric Surfaces 
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In the following two examples K denotes either the field R of real numbers 
or the field C of complex numbers, K* := {A € K|A #4 0} denotes the 
corresponding multiplicative group, and V denotes a vector space over K. 


Example 1.2.6. The projective space of V is the set of lines (through 
the origin) in V. In other words, 


P(V) = {€C V |Z is a 1-dimensional K-linear subspace} 


When K=R and V = R™*! this is denoted by RP” and when K=C 
and V = C+! this is denoted by CP™. For our purposes we can identify the 
spaces C+! and R?”+? but the projective spaces CP™ and RP?” are very 
different. The various lines  € P(V) intersect in the origin, however, after 
the identification P(V) = {[v] | v € V \ {0}} with [v] :-= K*v = Kv \ {0} the 
elements of P(V) become disjoint, i.e. P(V) is the set of equivalence classes 
of an equivalence relation on the open set V \ {0}. Assume that V = K™+! 
and define an atlas on P(V) as follows. For each integer 7 = 0,1,...,m 
define U; := {[v] |v = (xo0,.-.,%m), 2; # O} and define ¢; : U; > K™ by 


¢;([v]) = (2 gee eee 3) 


d 


This atlas consists of m+ 1 charts. 
Example 1.2.7. For each positive integer k the set 
G(V) := {Cc V |Z is a k-dimensional K-linear subspace} 


is called the Grassmann manifold of k-planes in V. Thus Gi(V) = P(V). 
Assume that V = K” and define an atlas on G;,(V) as follows. Let €1,...,€n 
be the standard basis for K”, i.e. e; is the ith column of the n x n identity 
matrix: 1,.. Each partition {1,2,...,n) = TUd, Ff = 44 -< se-< ah 
J = {ji < +++ < jn_x} of the first n natural numbers determines a direct 
sum decomposition K” = V = V; 6 V; via the formulas 


V7 = Ke;, + --- + Kej,, Vi = Rey ob Rez, - 5: 


Let Uy denote the set of all k-planes € € Gz(V) which are transverse to V7, 
i.e. such that 2M Vz = {0}. The elements of Uy; are precisely those k-planes 
of the form ¢= graph(A), where A: V; + Vz is a linear map. Define the 
map ¢, : Ur; > K**("—*) by the formula 


n—-k 
O10 Says); Ae Slaseyy PH Ijtsk: 
s=1 


Exercise: Prove that the set of all pairs (67, Ur) as I ranges over the subsets 
of {1,...,} of cardinality k form an atlas. 
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1.3. Topological Manifolds* 


Definition 1.3.1. A topological manifold is a topological space M such 
that each point p € M has an open neighborhood U which is homeomorphic 
to an open subset of a Euclidean space. 


Brouwer’s Invariance of Domain Theorem asserts that, when U Cc R™ 
and V C R” are nonempty open sets and @: U > V is a homeomorphism, 
then m=n. This means that if M is a connected topological manifold 
and some point of M has a neighborhood homeomorphic to an open subset 
of R™, then every point of M has a neighborhood homeomorphic to an open 
subset of that same R”. In this case we say that M has dimension m or 
is m-dimensional or is an m-manifold. Brouwer’s theorem is fairly difficult 
(see [24, p. 126] for example) but if ¢ is a diffeomorphism, then the result is 
an easy consequence of the invariance of the rank in linear algebra and the 
chain rule. (See equation (1.4.1) below.) 

By definition, a topological m-manifold M admits an atlas where every 
chart (¢,U) of the atlas is a homeomorphism ¢ : U + ¢(U) from an open 
set U Cc M to an open set 6(U) C R™. The following definition and lemma 
explain when a given atlas determines a topology on M. 


Definition 1.3.2. Let M be a set. Two charts (¢1,U1) and (¢2,U2) on M 
are said to be topologically compatible iff ¢1(U1 M U2) and ¢2(U1 N U2) 
are open subsets of R™ and the transition map 


gar = $20 O] + b1(U1 N U2) + $2(U1N U2) 
is a homeomorphism. An atlas is said to be a topological atlas iff any two 
charts in this atlas are topologically compatible. 
Lemma 1.3.3. Let Y = {(ba, Ua) aca be an atlas on a set M. 
(i) If & is a topological atlas, then the collection 


da(U MN U,) is an open subset of R™ } 


for everyac A (1.3.1) 


win {UcM 


is a topology on M, and with this topology each Ug is an open subset of M 
and each dg is a homeomorphism. Thus M is a topological manifold with 
the topology (1.3.1). 


(ii) If M is a topological manifold and each Us, is an open set and each da 
is a homeomorphism, then & is a topological atlas and the topology (1.3.1) 
coincides with the topology of M. 


Proof. Exercise. 
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If M is a topological manifold, then the collection of all charts (U, ¢) 
on M such that U is open and ¢ is a homeomorphism is a topological atlas. 
It is the unique maximal topological atlas in the sense that it contains 
every other topological atlas as in part (ii) of Lemma 1.3.3. However, we will 
often consider smaller, even finite, topological atlases, and by Lemma 1.3.3 
each of these determines the topology of M. 


Exercise 1.3.4. Show that the atlas in each example in 81.2 is a topological 
atlas. Conclude that each of these examples is a topological manifold. 


Any subset S C X of a topological space X inherits a topology from X, 
called the relative topology of S. A subset Up C S is called relatively 
open in S (or S-open) iff there is an open set U C X such that Up =UNS. 
A subset Ag C S' is called relatively closed (or S-closed) iff there is a 
closed set A C X such that Ag = AMS. The relative topology on S is the 
finest topology such that the inclusion map S — X is continuous. 


Exercise 1.3.5. Show that the relative topology satisfies the axioms of a 
topology (i.e. arbitrary unions and finite intersections of S-open sets are S- 
open, and the empty set and S itself are S-open). Show that the complement 
of an S-open set in S is S-closed and vice versa. 


Exercise 1.3.6. Each of the sets defined in Examples 1.2.2, 1.2.3, 1.2.4, 
and 1.2.5 is a subset of some Euclidean space R*. Show that the topology 
in Exercise 1.3.4 is the relative topology inherited from the topology of R*. 
The topology on R* is of course the metric topology defined by the distance 
function d(x, y) = |x — y|. 


If ~ is an equivalence relation on a topological space X, then the quo- 
tient space Y := X/~ := {[z]|x © X} is the set of all equivalence classes 
[z] :-= {x’ € X|a2' ~ x}. The map a7: X > Y defined by a(x) = [a] will 
be called the obvious projection. The quotient space inherits the quo- 
tient topology from Y. Namely, a set V C Y is open in this topology 
iff the preimage 7 ~'(V) is open in X. This topology is the finest topol- 
ogy on Y such that projection 7: X — Y is continuous. Since the oper- 
ation V+ 2 ~!(V) commutes with arbitrary unions and intersections the 
quotient topology obviously satisfies the axioms of a topology. 


Exercise 1.3.7. Show that the topology on the projective space P(V) in 
Example 1.2.6 determined by the atlas is the quotient topology inherited 
from the open set V \ {0}. Express the Grassmann manifold G;(V) in 
Example 1.2.7 as a quotient space and show that the topology determined 
by the atlas is the quotient topology. (Recall that in both examples V = K” 
with K=R or K=C.) 
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1.4 Smooth Manifolds Defined* 


Let U C R” and V C R™ be open sets. A map f : U > V is called smooth 
iff it is infinitely differentiable, i.e. iff all its partial derivatives 


OUT Fon f 


a An? 
da... Axo" 


i a = (a1,..-,Qn) € NG, 
exist and are continuous. In later chapters we will sometimes write C™(U, V) 
for the set of smooth maps from U to V. 


Definition 1.4.1. Let U C R” and V C R” be open sets. For a smooth 
map f =(fi,---,fm):U > V anda point x € U the derivative of f at x 
is the linear map df(x) :R" > R™ defined by 


fle+ #8) = tim £2 +8) -F@) 


df(w)é : im ; ,  €eR" 


© dt lh 


This linear map is represented by the Jacobian matrix of f at x which 
will also be denoted by 


oft Of, 
5a _ bE 
af(s)3= : : €R™*", 
Ofm 
See) + Gla (a) 


Note that we use the same notation for the Jacobian matrix and the corre- 
sponding linear map from R” to R™. 


The derivative satisfies the chain rule. Namely, if U C R", V Cc R”, 
and W Cc R¢ are open sets and f :U > V and g: V > W are smooth map, 
then go f : U > W is smooth and 


d(go f)(x) = dg(f(x)) odf(x) :R” > Rf (1.4.1) 


for every x € U. Moreover the identity map idy : U > U is always smooth 
and its derivative at every point is the identity map of R”. This implies that, 
if f : U — V isa diffeomorphism (i.e. f is bijective and f and f~! are both 
smooth), then its derivative at every point is an invertible linear map. This 
is why the Invariance of Domain Theorem (discussed after Definition 1.3.1) 
is easy for diffeomorphisms: if f : U > V is a diffeomorphism, then the 
Jacobian matrix df(x) € R™*” is invertible for every x € U and som=n. 
The Inverse Function Theorem (see Theorem A.2.2 in Appendix A.2) is a 
kind of converse. — 
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Definition 1.4.2 (Smooth manifold). Let M be a set. A chart on M 
is a pair ($,U) where U C M and ¢ is a bijection from U to an open sub- 
set o(U) C R™ of some Euclidean space. Two charts (¢,,U;) and (¢2, U2) 
are said to be smoothly compatible iff ¢;(U; M U2) and ¢2(U; MN U2) are 
both open in R™ and the transition map 


21 = $20 od; : oi(U1 a U2) -> o2(U4 M U2) (1.4.2) 


is a diffeomorphism. A smooth atlas on M is a collection & of charts 
on M any two of which are smoothly compatible and such that the sets U, 
as (¢,U) ranges over the elements of &, cover M (i.e. for every p € M 
there is a chart (¢,U) € & with p€ U). A maximal smooth atlas is an 
atlas which contains every chart which is smoothly compatible with each of 
its members. A smooth manifold is a pair consisting of a set M and a 
maximal smooth atlas & on M. 


Lemma 1.4.3. If & is a smooth atlas, then so is the collection & of all 
charts smoothly compatible with each member of &. The smooth atlas & is 
obviously maximal. In other words, every smooth atlas extends uniquely to 
a maximal smooth atlas. 


Proof. Let (¢1,U1) and (¢2,U2) be charts in & and let x € ¢1(U1 M U2). 
Choose a chart (¢,U) € & such that ¢7'(x) € U. Then $1(U NU, 9 U2) is 
an open neighborhood of « in R™ and the transition maps 


60 ¢,': d1(U NU, NU2) = ((U NU, NU2), 
d20¢ +: ¢(UNULNU2) > do(U NUN U2) 


are smooth by definition of </. Hence so is their composition. This shows 
that the map o20," : 61(U,NU2) > ¢2(U1NU2) is smooth near x. Since x 
was chosen arbitrary, this map is smooth. Apply the same argument to its 
inverse to deduce that it is a diffeomorphism. Thus & is a smooth atlas. 


Definitions 1.4.2 and 1.3.2 are mutatis mutandis the same, so every 
smooth atlas on a set M is a fortiori a topological atlas, i.e. every smooth 
manifold is a topological manifold. (See Lemma 1.3.3.) Moreover the defi- 
nitions are worded in such a way that it is obvious that every smooth map 
is continuous. 


Exercise 1.4.4. Show that each of the atlases from the examples in §1.2 is a 
smooth atlas. (You must show that the transition maps from Exercise 1.3.4 
are smooth.) 
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When . is a smooth atlas on a topological manifold M one says that / 
is a smooth structure on the (topological) manifold M iff # C &, 
where @ is the maximal topological atlas on M. When no confusion can 
result we generally drop the notation for the maximal smooth atlas as in the 
following exercise. 


Exercise 1.4.5. Define the notion of a continuous map between topological 
manifolds and of a smooth map between smooth manifolds via continuity, 
respectively smoothness, in local coordinates. Let M, N, and P be smooth 
manifolds and f : M— N and g: N > P be smooth maps. Prove that 
the identity map idyy is smooth and that the composition go f:M— P 
is a smooth map. (This is of course an easy consequence of the chain 
rule (1.4.1).) 


Remark 1.4.6. It is easy to see that a topological manifold can have 
many distinct smooth structures. For example, {(idp,R)} and {(¢,R)} 
where ¢(x) = 2° are atlases on the real numbers which extend to distinct 
smooth structures but determine the same topology. However these two 
manifolds are diffeomorphic via the map x +> 2!/3. In the 1950’s it was 
proved that there are smooth manifolds which are homeomorphic but not dif- 
feomorphic and that there are topological manifolds which admit no smooth 
structure. In the 1980’s it was proved in dimension m = 4 that there are 
uncountably many smooth manifolds that are all homeomorphic to R* but 
no two of them are diffeomorphic to each other. These theorems are very 
surprising and very deep. 


A collection of sets and maps between them is called a category iff the 
collection of maps contains the identity map of every set in the collection and 
the composition of any two maps in the collection is also in the collection. 
The sets are called the objects of the category and the maps are called the 
morphisms of the category. An invertible morphism whose inverse is also in 
the category is called an isomorphism. Some examples are the category of 
all sets and maps, the category of topological spaces and continuous maps 
(the isomorphisms are the homeomorphisms), the category of topological 
manifolds and continuous maps between them, and the category of smooth 
manifolds and smooth maps (the isomorphisms are the diffeomorphisms). 
Each of the last three categories is a subcategory of the preceding one. 

Often categories are enlarged by a kind of “gluing process”. For example, 
the “global” category of smooth manifolds and smooth maps was constructed 
from the “local” category of open sets in Euclidean space and smooth maps 
between them via the device of charts and atlases. (The chain rule shows 
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that this local category is in fact a category.) The point of Definition 1.3.2 
is to show (via Lemma 1.3.3) that topological manifolds can be defined 
in a manner analogous to the definition we gave for smooth manifolds in 
Definition 1.4.2. 

Other kinds of manifolds (and hence other kinds of geometry) are de- 
fined by choosing other local categories, i.e. by imposing conditions on the 
transition maps in Equation (1.4.2). For example, a real analytic manifold 
is one where the transition maps are real analytic, a complex manifold is one 
whose coordinate charts take values in C” and whose transition maps are 
holomorphic diffeomorphisms, and a symplectic manifold is one whose coor- 
dinate charts take values in R2” and whose transition maps are canonical 
transformations in the sense of classical mechanics. Thus CP” is a complex 
manifold and RP” is a real analytic manifold. 


1.5 The Master Plan 


In studying differential geometry it is best to begin with extrinsic differential 
geometry which is the study of the geometry of submanifolds of Euclidean 
space as in Examples 1.2.3, and 1.2.5. This is because we can visualize 
curves and surfaces in R°. However, there are a few topics in the later 
chapters which require the more abstract Definition 1.4.2 even to say inter- 
esting things about extrinsic geometry. There is a generalization to these 
manifolds involving a structure called a Riemannian metric. We will call 
this generalization intrinsic differential geometry. Examples 1.2.6 and 1.2.7 
fit into this more general definition so intrinsic differential geometry can be 
used to study them. 


Since an open set in Euclidean space is a smooth manifold the definition 
of a submanifold of Euclidean space (see §2.1 below) is mutatis mutandis 
the same as the definition of a submanifold of a manifold. The definitions 
in Chapter 2 are worded in such a way that it is easy to read them either 
extrinsically or intrinsically and the subsequent chapters are mostly (but not 
entirely) extrinsic. Those sections which require intrinsic differential geom- 
etry (or which translate extrinsic concepts into intrinsic ones) are marked 
with a *. 
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CHAPTER 1. 


WHAT IS DIFFERENTIAL GEOMETRY? 


Chapter 2 


Foundations 


This chapter introduces various fundamental concepts that are central to 
the fields of differential geometry and differential topology. Both fields con- 
cern the study of smooth manifolds and their diffeomorphisms. The chapter 
begins with an introduction to submanifolds of Euclidean space and smooth 
maps (§2.1), to tangent spaces and derivatives (§2.2), and to submanifolds 
and embeddings (§2.3). In §2.4 we move on to vector fields and flows and 
introduce the Lie bracket of two vector fields. Lie groups and their Lie alge- 
bras, in the extrinsic setting, are the subject of 82.5, which includes a proof 
of the Closed Subgroup Theorem. In 82.6 we introduce vector bundles over 
a manifold as subbundles of a trivial bundle and in 82.7 we prove the theo- 
rem of Frobenius. The last two sections of this chapter are concerned with 
carrying over all these concepts from the extrinsic to the intrinsic setting 
and can be skipped at first reading (82.8 and §2.9). 


2.1 Submanifolds of Euclidean Space 


To carry out the Master Plan §1.5 we must (as was done in [50]) extend 
the definition of smooth map to ‘Maps f :X > Y between subsets X Cc R* 
and Y C R¢ which are not necessarily open. In this case a map f : X > Y is 
called smooth iff for each x9 € X there exists an open neighborhood U c R* 
of zo and a smooth map F': U > R* that agrees with f on UM X. A 
map f : X > Y is called a diffeomorphism iff f is bijective and f and f~! 
are smooth. When there exists a diffeomorphism f : X — Y then X and Y 
are called diffeomorphic. When X and Y are open these definitions coin- 


cide with the usage in 81.4. 
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Exercise 2.1.1 (Chain Rule). Let X Cc R*, Y C R‘, Z C R™ be arbitrary 
subsets. If f : X — Y and g: Y — Z are smooth maps, then so is the 
composition go f : X + Z. The identity map id: X + X is smooth. 


Exercise 2.1.2. Let E Cc R*® be an m-dimensional linear subspace and 
let v1,...,Um be a basis of FE. Then the map f : R™ — E defined by 
f(x) = Oy2, it; is a diffeomorphism. 


Definition 2.1.3. Let k,m € No. A subset M Cc R* is called a smooth 
m-dimensional submanifold of R* iff every point p © M has an open 
neighborhood U C R® such that UN M is diffeomorphic to an open sub- 
setQ CR”. A diffeomorphism 


o:UNMAQ 
is called a coordinate chart of M and its inverse 
w:=¢'1:23UNM 


is called a (smooth) parametrization of UN M. 


UnM M fn) Q 
( Cay _— > 


v 


Figure 2.1: A coordinate chart 6: UNM >. 


In Definition 2.1.3 we have used the fact that the domain of a smooth 
map can be an arbitrary subset of Euclidean space and need not be open 
(see page 15). The term m-manifold in R* is short for m-dimensional sub- 
manifold of R®. In keeping with the Master Plan 81.5 we will sometimes say 
manifold rather than submanifold of R® to indicate that the context holds 


in both the intrinsic and extrinsic settings. 


Lemma 2.1.4. If MC R* is a nonempty smooth m-manifold, then m < k. 


Proof. Fix an element po € M, choose a coordinate chart 6: UNM > Q 
with po € U and values in an open subset 2 Cc R™, and denote its in- 
verse by := 67':Q—-UNM. Shrinking U, if necessary, we may as- 
sume that ¢ extends to a smooth map ®: U — R™. This extension satis- 
fies ®(¢)(x)) = o(W(x)) =a and hence d®(q(x))dy(x) = id: R™ > R™ for 
all 2 € Q, by the chain rule. Hence the derivative dy)(x) : R™ — R* is in- 
jective for all x € 2, and hence m < k because 2 is nonempty. This proves 
Lemma, 2.1.4. 
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Example 2.1.5. Consider the 2-sphere 
M:=S? ={(2,y,z)ER' |e’ +y4+22=1} 
depicted in Figure 2.2 and let U C R? and Q C R? be the open sets 
U := {(2,y,z) € R®|z>0}, Q:= {(2,y) € R’|a?+y? <1}. 
The map ¢: UNM = Q given by 
b(x, y, 2) = (x,y) 
is bijective and its inverse w := d-1:Q 4 UN M is given by 
(zy) = (2, y, V1 — 2? —y?). 


Since both ¢ and w are smooth, the map ¢ is a coordinate chart on S?. 
Similarly, we can use the open sets z < 0, y > 0, y < 0,2 > 0, x < 0 to cover 
S? by six coordinate charts. Hence S? is a manifold. A similar argument 
shows that the unit sphere S$” C R™+! (see Example 2.1.14 below) is a 
manifold for every integer m > 0. 


=. 


Figure 2.2: The 2-sphere and the 2-torus. 


Example 2.1.6. Let 2 C R™ be an open set and h: Q > R*-™ be asmooth 
map. Then the graph of h is a smooth submanifold of R™ x R*-™ = R*: 


M = graph(h) := {(z,y)|z EQ, y=A(z)}. 


It can be covered by a single coordinate chart 6 : UMM — V where 
U :=0x R*-™, ¢ is the projection onto 2, and a) := 1:2 > U is given 
by ¥(2) = (2, h(x)) for ¢ € 2. 


Exercise 2.1.7 (The case m = 0). Show that a subset M C R* is a 0- 
dimensional submanifold if and only if M is discrete, i.e. for every p © M 
there exists an open set U C R* such that UM M = {pt}. 
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Exercise 2.1.8 (The case m = k). Show that a subset M C R™ is an 
m-dimensional submanifold if and only if M is open. 


Exercise 2.1.9 (Products). If M; C R* is an m;-manifold for i = 1,2, 
show that M, x Mp is an (m; + m2)-dimensional submanifold of R®+'2. 
Prove that the m-torus T™ := (S')™ is a smooth submanifold of C™. 


The next theorem characterizes smooth submanifolds of Euclidean space. 
In particular condition (iii) will be useful in many cases for verifying the 
manifold condition. We emphasize that the sets Up := UM M that appear 
in Definition 2.1.3 are open subsets of M with respect to the relative topology 
that M inherits from the ambient space R* and that such relatively open 
sets are also called M-open (see page 9). 


Theorem 2.1.10 (Manifolds). Let m and k be integers withO<m<k. 
Let M C R¥ be a set and pp € M. Then the following are equivalent. 


(i) There exists an M-open neighborhood Up C M of po and a diffeomor- 
phism og : Up 4 Qo onto an open set Qo C R™. 


(ii) There exist open sets U,Q Cc R* and a diffeomorphism @ : U + Q such 
that pp € U and 
e(UNM) =2N(R™ x {0}) 


(see Figure 2.3). 
(iii) There exists an open set U C R® and a smooth map f : U — R*-™ 
such that po € U, the derivative df(p) : R*’ + R*-™ is surjective for every 
point pe UNM, and 

UNM = f*(0) ={p € U| fm) = 0}. 


Moreover, if (i) holds, then the diffeomorphism ¢:U > Q in (ii) can be 
chosen such that UM M C Up and ¢(p) = (¢o0(p),0) for everypEe UNM. 


U QD 


CNB) —— 


Figure 2.3: Submanifolds of Euclidean space. 


M 


2.1. SUBMANIFOLDS OF EUCLIDEAN SPACE 19 


Proof. First assume (ii) and denote the diffeomorphism in (ii) by 


b = (1, ¢2,.-., 4) :U ACR. 
Then part (i) holds with Up := UN M, Oo := {a € R™ | (x, 0) € O}, and 


do = (d1,.-- Om), : Uo = Qo, 


and part (iii) holds with f := (dm41,...,¢r) : U + R*-™. This shows that 
part (ii) implies both (i) and (iii). 

We prove that (i) implies (ii). Let ¢9 : Up Qo be the coordinate chart 
in part (i), let Wo := ce : Qo > Up be its inverse, and let xo := ¢0(po) € Qo. 
Then the derivative dyo(xo) : R™ — R* is injective by Lemma 2.1.4. Hence 
there exists a matrix B € R**(*—™) such that det({dwzo(xo) B]) 4 0. Define 
the map # : Q x R*-™ > R* by 

v(x, y) = bo(x) + By. 
Then the k x k-matrix dy(xo,0) = [dwo(x0) B] € R*** is nonsingular, by 
choice of B. Hence, by the Inverse Function Theorem A.2.2, there exists an 


open neighborhood Q C No x R*-™ of (29,0) such that U := )(Q) C RF is 
open and wg: Q—-> Uisa diffeomorphism. In particular, the restriction of w 


to Q is injective. Now the set {x € Q| (x,0) € Q} is open and contains 2p. 
Hence the set 


Up = { do(e) |x € Qo, (#,0) € a} = {p E Us| (¢0(p), 0) € ah CM 
is M-open and contains po. Hence, by the definition of the relative topology, 
there exists an open set W C R* such that Up) = WOM. Define 
U:=UNW, Q:=Qnyl(W). 
Then UN M = Up and w restricts to a diffeomorphism from 2 to U. 
Now let (2, y) € Q. We claim that 
v(z,y)€ M a y=0. (21.1) 


If y = 0, then obviously w(x, y) = Wo(x) € M. Conversely, let (x,y) € Q and 
suppose that p:= p(z,y) € M. Thenpe UNM =UNWOM= Us c Up 
and hence (¢o(p),0) € 2, by definition of Up. This implies 


w(¢0(p), 9) = vo(go(p)) = p = V(x, y). 


Since the pairs (x,y) and (¢9(p),0) both belong to the set Q and the re- 
striction of = to Q is injective we obtain z = ¢o(p) and y = 0. This 
proves (2.1.1). It follows from (2.1.1) that the map ¢:= (Wj): U = Q 
satisfies (UNM) = {(2,y) € Q] d(z,y) € M} =QN(R™ x {0}). Thus we 
have proved that (i) implies (ii). 
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We prove that (iii) implies (ii). Let f : U + R*-™ be as in part (iii). 
Then po € U and the derivative df (po) : R* + R*-™ is a surjective linear 
map. Hence there exists a matrix A € R™** such that 


sot( yy) 0 


Define the map ¢: U > R* by 


o(p) := ( ) epee. 


Then det(d¢(po)) 4 0. Hence, by the Inverse Function Theorem A.2.2, there 
exists an open neighborhood U’ c U of po such that 9’ := $(U’) is an open 
subset of R* and the restriction ¢! := ¢|y : U’ > 0! is a diffeomorphism. 
In particular, the restriction ¢|y is injective. Moreover, it follows from the 
assumptions on f and the definition of ¢ that 


U'NM = {peU'| f(p) =0} = {pe U"| 6p) ER™ x {OF} 


and so ¢!(U'NM) = 0'N(R™ x {0}). Hence the diffeomorphism ¢’ : U’ > 
satisfies the requirements of part (ii). This proves Theorem 2.1.10. 


The next corollary relates the notion of a smooth map on a smooth 
submanifold as defined on page 15 to the standard notion of smoothness in 
local coordinates used in the intrinsic setting of §2.8 below. 


Corollary 2.1.11. Let M Cc R* be a smooth m-dimensional submanifold 
and let f : M— R* be a map. Then the following are equivalent. 


(i) f is smooth (as defined on page 15). 


(ii) If Uo C M is an M-open set and ¢o : Up + Qo is a diffeomorhism onto 
an open set Q9 C R™, then the composition f o 1. — R¢ is smooth. 


Proof. Assume (ii), let po € M, and choose ¢ = (41,...,¢%) : U 3 A.C R* 
as in part (ii) of Theorem 2.1.10. Shrinking U, if necessary, we may assume 
that Q =p x Q1, where Q C R™ is an open set and Q; C R*-™ is an 
open neighborhood of the origin. Then the map N) > R’: 2 fo @7!(z,0) 
is smooth by part (ii). Define F(p) := fo ¢7!(¢1(p),.--,@m(p),0,...,0) 
for p € U. Then the map F : U + R° is smooth and agrees with f on UNM. 
Thus f satisfies (i). That (i) implies (ii) follows directly from the definitions 
and this proves Corollary 2.1.11. 
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Definition 2.1.12 (Regular value). Let U Cc R* be an open set and 
let f :U > R* be a smooth map. An element c € R° is called a regular 
value of f iff, for allp © U, we have 


fM=He = df(p) : R® > R° is surjective. 


Otherwise c is called a singular value of f. Theorem 2.1.10 asserts that, 
if c is a regular value of f, then the preimage 


M := f-'(c)={peU| f(p) =c} 


is a smooth (k — )-dimensional submanifold of R*. 


Examples and Exercises 


Example 2.1.13. Let A = A' € R*** be a symmetric matrix and de- 
fine the function f : R*’ > R by f(x) := v' Ax. Then df(x)€ = 2x' AE 
for a,€ € R* and hence the linear map df(x) : R* > R is surjective if and 
only if Ax £0. Thus c = 0 is the only singular value of f and hence, for 
every element c € R \ {0}, the set M:= f-l(c) ={xz € R*|x2' Ar =c} isa 
smooth manifold of dimension m = k — 1. 


Example 2.1.14 (The sphere). As a special case of Example 2.1.13 con- 
sider the case k =m-+1, A = 1, and c=1. Then f(x) = |2|* and so we 
have another proof that the unit sphere 


s™ = {zx ER™| |g? = i} 


in R™*! is a smooth m-manifold. (See Examples 1.2.4 and 2.1.5.) 


Example 2.1.15. Define the map f : R® x R? > R by f(x,y) := |x —y’. 
This is another special case of Example 2.1.13 and so, for every r > 0, the 
set M := {(x,y) € R® x R?| |x — y| =r} is a smooth 5-manifold. 


Example 2.1.16 (The 2-torus). Let 0 <r <1 and define f : R®? > R by 


f(x,y, 2) = (a? + y? +r? — 2 — 1)? — 4(2? + y*)(r? — 2”). 


This map has zero as a regular value and M := f~1(0) is diffeomorphic to 
the 2-torus T? = S$! x S'. An explicit diffeomorphism is given by 

(els elf) G ((1 + r-cos(s)) cos(t), (1 + r cos(s)) sin(t), 7 sin(s)). 
This example corresponds to the second surface in Figure 2.2. 


Exercise: Show that f(z, y, z) = 0 if and only if (,/2? + y?2 — 1)? 4+ 22 =r. 
Verify that zero is a regular value of f. 
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Example 2.1.17. The set 
M := Le? a? z Ye; zu, ry) | LLY, ZE R, = + y? + e = 1} 
is a smooth 2-manifold in R®. To see this, define an equivalence relation on 


the unit sphere S? C R® by p~ q iff g = +p. The quotient space (the set of 
equivalence classes) is called the real projective plane and is denoted by 


RP? := §?/{+1}. 


(See Example 1.2.6.) It is equipped with the quotient topology, i.e. a sub- 
set U C RP? is open, by definition, iff its preimage under the obvious projec- 
tion S? > RP? is an open subset of S?. Now the map f : 5? > R® defined 
by 


ie, Y; Zz) = (ee 0s aX, ry) 


descends to a homeomorphism from RP? onto M. The submanifold M is 
covered by the local smooth parameterizations 


= Ms (,y) > fe, /l= 2— 7), 
QM : (2,2) f(x, V1—2? — 27,2), 


Q>M :(y,z) fi Lay? = 27,912) 


defined on the open unit disc Q C R?. We remark the following. 

(a) M is not the preimage of a regular value under a smooth map R® —> R*. 
(b) M is not diffeomorphic to a submanifold of R°. 
(c) The projection © := { (yz, z@, cy) |c,y,zER, et+yt+2= 1} of M 
onto the last three coordinates is called the Roman surface and was dis- 
covered by Jakob Steiner. The Roman surface can also be represented as 


the set of solutions (€,7,¢) € R® of the equation 72¢? + C7€? + €?n? = Enc. 
It is not a submanifold of R°. 


Exercise: Prove this. Show that M is diffeomorphic to a submanifold of R*. 
Show that M is diffeomorphic to RP? as defined in Example 1.2.6. 


Exercise 2.1.18. Let V : R" — R be a smooth function and define the 
Hamiltonian function H : R” x R” > R (kinetic plus potential energy) by 


H(2,y) = 5 |v? +V (2). 


Prove that c is a regular value of H if and only if it is a regular value of V. 
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Exercise 2.1.19. Consider the general linear group 
GL(n,R) = {g € R"“" | det(g) 4 0} 
Prove that the derivative of the function f = det : R"*” — R is given by 
df (g)v = det(g) trace(g~!v) 
for all g € GL(n, R) and v € R"*”. Deduce that the special linear group 
SL(n,R) := {g € GL(n, R) | det(g) = 1} 
is a smooth submanifold of R”*”. 


Example 2.1.20. The orthogonal group 

O(n) := {9 ER" |g'g = a} 
is a smooth submanifold of R”*”. To see this, denote by 

Pys= {s ER x" | ST — s} 
the vector space of symmetric matrices and define f : R"*” > .Y, by 

f(g) =9"9. 
Its derivative df(g) : R"*" > A, is given by 
df(g)v =glu+u'g. 


This map is surjective for every g € O(n): if g'g = land S=S' €.%, 
then the matrix v := 5 gS satisfies 
ie | eee ee ome ee 
df(gju= 59 98 + 5(98) g= 5545 
Hence 1 is a regular value of f and so O(n) is a smooth manifold. It has 
the dimension 
n(n+1)  n(n—1) 


dim O(n) = n? — dim.%, = n? 5 = 5 


Exercise 2.1.21. Prove that the set 
M := {(z,y) € R’|cy =0} 


is not a submanifold of R?. Hint: If U C R? is a neighborhood of the 
origin and f : U — R is a smooth map such that UM M = f~!(0), then 
df (0,0) =0. 
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Figure 2.4: The tangent space T,M and the translated tangent space p+7T,M. 


2.2 Tangent Spaces and Derivatives 


The main reason for first discussing the extrinsic notion of embedded mani- 
folds in Euclidean space as explained in the Master Plan 81.5 is that the 
concept of a tangent vector is much easier to digest in the embedded case: 
it is simply the derivative of a curve in M, understood as a vector in the 
ambient Euclidean space in which M is embedded. 


2.2.1 Tangent Space 


Definition 2.2.1 (Tangent vector). Let M C R* be a smooth m-dimen- 
sional manifold and fix a point p € M. A vector v € R* is called a tangent 
vector of M at p iff there exists a smooth curve y: R— M such that 


y(0)=p, —-¥(0) =v. 
The set 
T,)M := {7(0)|y: R- M is smooth, y(0) = p} 
of tangent vectors of M at p is called the tangent space of M at p. 


Theorem 2.2.3 below shows that 7, is a linear subspace of R*. As does 
any linear subspace it contains the origin; it need not actually intersect M. 
Its translate p+7,M touches M at p; this is what you should visualize 
for T,M (see Figure 2.4). 


Remark 2.2.2. Let p € M be as in Definition 2.2.1 and let v € R®. Then 


€>0 dy: (-e,¢) ~ M such that 
yy is smooth, 7(0) = p, 7(0) = v. 


ve T,M => { 


To see this suppose that 7 : (—e,¢) + M is a smooth curve with y(0) = p 
and 7(0) = v. Define y: R- M by 


=(t) =() , teR 


Then ¥ is smooth and satisfies ¥(0) = p and ¥(0) =v. Hence v € TpM. 


2.2. TANGENT SPACES AND DERIVATIVES 25 


Theorem 2.2.3 (Tangent spaces). Let M Cc R* be a smooth m-dimen- 
sional manifold and fix a point p€ M. Then the following holds. 


(i) Let Up C M be an M-open set with p € Up and let do: Up 4 Oo 
be a diffeomorphism onto an open subset N97 CR™. Let xo := do(p) and 
let Wo := bo - :Q 9 — Uo be the inverse map. Then 


T,M = im (dub0(0) -R™ R‘) 
(ii) Let U,Q Cc R* be open sets and ¢: U + Q be a diffeomorphism such 
that p€ U and 6(UNM) =2N (R™ x {O}). Then 
T,M = dd(p)* (R™ x {0}). 


(iii) Let U C R*® be an open neighborhood of p and f : U + R*-™ be a 
smooth map such that 0 is a regular value of f and UNM = f-+(0). Then 


T,M = ker df(p). 


(iv) T,M is an m-dimensional linear subspace of R*. 


Proof. Let Wo : Qo 4 Up and xp € No be as in (i) and let 6: U > Q be as 
in (ii). We prove that 


im difo(z0) C TpM Cc dé(p)~* (R™ x {0}). (22.41) 
To prove the first inclusion in (2.2.1), choose a constant r > 0 such that 
B,(xo) := {x € R™| |x — zo] < r} Co. 
Now let € € R’™ and choose ¢ > 0 so small that 
ele <r. 
Then xo + t€ € Qo for all t € R with |t| < ¢. Define 7: (—e,e) — M by 
y(t) := wo(xo + té) for —e<t<e. 


Then y is a smooth curve in M satisfying 


(0) = po(to) = p, ¥(0) = “ - Wo(xo + t€) = dyo(xo)&. 


Hence it follows from Remark 2.2.2 that dyo(xo)€ € T,)M, as claimed. 
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To prove the second inclusion in (2.2.1) we fix a vector v € T,M. Then, 
by definition of the tangent space, there exists a smooth curve y : R > M 
such that 7(0) = p and 7(0) = v. Let U C R* be as in (ii) and choose € > 0 


so small that y(t) € U for |t] < «. Then 
o(y(t)) € (UNM) CR™ x {0} 
for |t] < ¢ and hence 


d 


do(p)v = de(y(0))7(0) = dil. 


o(y(t)) € R™ x {0}. 
This shows that v € d¢(p)~' (R™ x {0}) and thus we have proved (2.2.1). 
Now the sets im dao(xo) and d¢(p)~! (R™ x {0}) are both m-dimensional 
linear subspaces of R*. Hence it follows from (2.2.1) that these subspaces 
agree and that they both agree with 7,M. Thus we have proved asser- 
tions (i), (ii), and (iv). 
We prove (iii). Let v € T,M. Then there is a smooth curve 7: R- M 
such that 7(0) = p and ¥(0) = v. For t sufficiently small we have y(t) € U, 
where U C R* is the open set in (iii), and f(y(t)) = 0. Hence 


afte = at) = Fl _ F(0) =. 


This implies T,.M C ker df(p). Since T,M and the kernel of df(p) are both 
m-dimensional linear subspaces of R*, we deduce that TpM = ker df(p). 
This proves part (iii) and Theorem 2.2.3. 


Exercise 2.2.4. Let M Cc R* be a smooth m-dimensional manifold and 
let pj € M be a sequence that converges to a point pE M. Let 7 be a 
sequence of nonzero real numbers and let v € R* such that 


lim 7; = 0, lim fT v. 
i—oo 70) Tj 


Prove that v € T,M. Hint: Use part (iii) of Theorem 2.2.3. 


Example 2.2.5. Let A = A' € R*** be a nonzero matrix as in Exam- 
ple 2.1.13 and let c # 0. Then part (iii) of Theorem 2.2.3 asserts that the 
tangent space of the manifold 


M = {xeR* |" Ax =ch 
at a point x € M is the (k — 1)-dimensional linear subspace 


T,M = {€ €R*|27 Ag =o}. 
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Example 2.2.6. As a special case of Example 2.2.5 with A = ll andc=1 
we find that the tangent space of the unit sphere S$” C R™*! at a point 
x € S™ is the orthogonal complement of x: 


Te =e ={Fer™ | a0 =0!. 


Here (x,€) = S>j".9 vi€ denotes the standard inner product on R™*?. 


Exercise 2.2.7. What is the tangent space of the 5-manifold 
M := {(z,y) € R® x R*| |x -—y| =r} 
at a point (x,y) € M? (See Exercise 2.1.15.) 


Example 2.2.8. Let H(2z,y) := 5 \y|° + V(x) be as in Exercise 2.1.18 and 
let c be a regular value of H. If (x,y) € M := H~'(c), then 


Tioy)M = {(6,7) € R” x R"| (y,n) + (VV (x), €) = OF. 
Here VV := (OV/021,...,0V/Orn) : R" + R” denotes the gradient of V. 


Exercise 2.2.9. The tangent space of SL(n, R) at the identity matrix is the 
space 


sl(n,R) := TySL(n, R) = {€ € R™™” | trace(€) = 0} 
of traceless matrices. (Prove this, using Exercise 2.1.19.) 


Example 2.2.10. The tangent space of O(n) at g is 
TgO(n) = {v ER™"| glutaltg= 0} : 


In particular, the tangent space of O(n) at the identity matrix is the space 
of skew-symmetric matrices 


o(n) := T,O(n) = {é ERM eT 4 e = o} 


To see this, choose a smooth curve R + O(n) : t+ g(t). Then g(t)' g(t) = 1 
for all t€ R and, differentiating this identity with respect to t, we ob- 
tain g(t)" g(t) + g(t)" g(t) =0 for every t. Hence every matrix v € T,O(n) 
satisfies the equation g'v+u'g =0. With this understood, the claim fol- 
lows from the fact that g'v+vu'g =0 if and only if the matrix €:= g~!v 
is skew-symmetric and that the space of skew-symmetric matrices in R"*” 


has dimension n(n — 1)/2. 


Exercise 2.2.11. Let Q C R™ be an open set and h : Q > R*-™ be a 
smooth map. Prove that the tangent space of the graph of h at a point 
(x, h(x)) is the graph of the derivative dh(x) : R™ > R*-™: 


M={(a#,h(x))|e EQ}, — Taney M = {(6,dh(@)g)|€ € R™}. 
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Exercise 2.2.12 (Monge coordinates). Let M be a smooth m-manifold 
in R* and suppose that p € M is such that the projection T, pM — R™ x {0} 
is invertible. Prove that there exists an open set Q C R™ and a smooth 
map h:Q— R*-™ such that the graph of h is an M-open neighborhood 
of p (see Example 2.1.6). Of course, the projection T,M — R™ x {0} need 
not be invertible, but it must be invertible for at least one of the (*) choices 
of the m-dimensional coordinate plane. Hence every point of M has an M- 
open neighborhood which may be expressed as a graph of a function of some 


of the coordinates in terms of the others as in e.g. Example 2.1.5. 


2.2.2. Derivative 


A key purpose behind the concept of a smooth manifold is to carry over 
the notion of a smooth map and its derivatives from the realm of first year 
analysis to the present geometric setting. Here is the basic definition. It ap- 
peals to the notion of a smooth map between arbitrary subsets of Euclidean 
spaces as introduced on page 15. 


Definition 2.2.13 (Derivative). Let M Cc R* be an m-dimensional smooth 


manifold and let 
f:Mo>R 


be a smooth map. The derivative of f at a point p € M is the map 
df(p) : Ty» M > R® 
defined as follows. Given a tangent vector v € T,M, choose a smooth curve 
y: RAM 


satisfying 
V0)=p, 40) =», 
and define the vector df(p)v € R® by 


(2.2.2) 


That the limit on the right in equation (2.2.2) exists follows from our 
assumptions. We must prove, however, that the derivative is well defined, 
i.e. that the right hand side of (2.2.2) depends only on the tangent vector v 
and not on the choice of the curve y used in the definition. This is the 
content of the first assertion in the next theorem. 
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Theorem 2.2.14 (Derivatives). Let M C R* be an m-dimensional smooth 
manifold and f : M — R* be a smooth map. Fix a point p € M. Then the 


following holds. 
(i) The right hand side of (2.2.2) is independent of y. 


(ii) The map df(p) : TpM > R° is linear. 
(iii) If N C R° is a smooth n-manifold and f(M) CN, then 


df (p)TpM e Ts(p)N. 


(iv) (Chain Rule) Let N be as in (iti), suppose that f(M) C N, and 
let g: N — R® be a smooth map. Then 


d(go f)(p) = dg(f(p)) df (p) : Ty»M > R*. 
(v) If f =id: M > M, then df(p) =id:T,M > T,M. 


Proof. We prove (i). Let v € T,M and y:R—- M beas in Definition 2.2.13. 
By definition there is an open neighborhood U C R* of p and a smooth 
map F': U — R° such that 


F(p’) = f(p’) for all p) EUNM. 


Let dF(p) € R* denote the Jacobian matrix (i.e. the matrix of all first 
partial derivatives) of F at p. Then, since y(t) ¢ UMM for t sufficiently 
small, we have 


dF(p)u = dF (y(0))4(0) 


d 
FPO) 


d 
Ff): 


The right hand side of this identity is independent of the choice of F’ while 
the left hand side is independent of the choice of y. Hence the right hand 
side is also independent of the choice of 7 and this proves (i). Assertion (ii) 
follows immediately from the identity 


just established. 
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Assertion (iii) follows directly from the definitions. Namely, if y is as in 
Definition 2.2.13, then 6 := foy:R-— N is a smooth curve in N satisfying 


BO) =fOO)=fe)=aq 60) =df(p)jv= wu. 
Hence w € T,N. Assertion (iv) also follows directly from the definitions. 
If g: N > R¢ is a smooth map and £,q, w are as above, then 


= 5 atfon 


d 
= a os g(B(t)) 
= dg(q)w 
= dg(f(p))df(p)v. 


This proves (iv). Assertion (v) follows again directly from the definitions 
and this proves Theorem 2.2.14. 


d(go f)(p)v 


Corollary 2.2.15 (Diffeomorphisms). Let M Cc R* be a smooth m-mani- 
fold and N C R® be a smooth n-manifold and let f : M > N be a diffeomor- 
phism. Then m=n and the derivative df(p):T,)M — Typ)N is a vector 
space isomorphism with inverse 


df (p)~* = df-*(f(p)) : Ty@)N > TpM 
for allpe M. 


Proof. Define g := f~!: N > M so that go f = idy and f og = idy. 
Then it follows from Theorem 2.2.14 that, for p € M and q := f(p) € N, we 
have dg(q) odf(p) = id: T,M — T,M and df(p) odg(q) = id: T,N — T,N. 
Hence df(p) : T,M — T,N is a vector space isomorphism and its inverss is 
given by dg(q) = df(p)~! : TyN — T,M. Hence m =n and this proves 
Corollary 2.2.15. 


Exercise 2.2.16. Let M Cc R* be asmooth manifold and let f : M > R* be 
a smooth map. Let p; € M be a sequence that converges to a point pe M, 
let 7; be a sequence of nonzero real numbers, and let v € T,,M such that 


Pi— Pp 


lim 47, = 0, lim ——— = v. 
100 400 Ti 
(See Exercise 2.2.4.) Prove that 
1 0o Th 


Hint: Use the local extension F’ of f in the proof of Theorem 2.2.14. 
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2.2.3. The Inverse Function Theorem 


Corollary 2.2.15 is analogous to the corresponding assertion for smooth maps 
between open subsets of Euclidean space. Likewise, the inverse function 
theorem for manifolds is a partial converse of Corollary 2.2.15. 


Figure 2.5: The Inverse Function Theorem. 


Theorem 2.2.17 (Inverse Function Theorem). Assume that M Cc R* 
and N C R* are smooth m-manifolds and f :M— N is a smooth map. 
Let po € M and suppose that the derivative 


df (po) 2 Typ M = Lei 


is a vector space isomorphism. Then there exists an M-open neighbor- 
hood U C M of po such that V := f(U) C N is an N-open subset of N and 
the restriction f|y :U — V is a diffeomorphism. 


Proof. Choose coordinate charts ¢9 : Uo > oi defined on an M-open neigh- 
borhood Up C M of po onto an open set Up C R™, and Wo : Vo > Vo, defined 
on an N-open neighborhood Vo Cc N of f(po) onto an open set Vo CR™. 
Shrinking Uo, if necessary, we may assume that f(Uo) C Vo. Then the map 


f = Wefods': U7 Vo 


(see Figure 2.5) is smooth and its derivative df (29) : R™ 3 R™ is bijective 
at 20 := = $0(p0). Hence the Inverse Function Theorem A.2.2 asserts that 
there exists an open neighborhood Uc Uo of zp such that V := f(U U) is an 
open subset of Vo and the restriction of f to [3 is a ciieomioepal a from U 
to V. Hence the assertion holds with U := ¢)'(U U) and V := wo (V). This 
proves Theorem 2.2.17. 
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Regular Values 


Definition 2.2.18 (Regular value). Let M C R* be a smooth m-manifold, 
let N C R® be a smooth n-manifold, and let f : M —» N be a smooth map. 
An element q€ N is called a regular value of f iff, for every pe M 
with f(p) =q, the derivative df(p) : TpM — Typ)N is surjective. 


Theorem 2.2.19 (Regular values). Let f : M — N be as in Defini- 
tion 2.2.18 and let q € N be a regular value of f. Then the set 


P:= f"(q) ={pE M| f(p) = 4} 


is a smooth submanifold of R® of dimension m — n and, for each point p € P, 
its tangent space at p is given by 


TP = ker df(p) = {v € T,M | df(p)v = 0}. 


Proof 1. Fix a point po € P and choose a linear map A: R* + R™-"” such 
that the restriction of A to the (m — n)-dimensional linear subspace 


ker df (po) = {v © Too M | df (po) = 0} C Ty,M Cc R* 
is a vector space isomorphism. Define the map Ff’: M > N x R™"” by 


F(p) := (f(p), Ap) 


for p€ M. The derivative of F' at po is given by dF(po)v = (df(po)v, Av) 
for v € TM and is a vector space isomorphism. Hence, by Theorem 2.2.17 
there exists an M-open neighborhood U Cc M of po such that V := F(U) 
is an open subset of N x R™” in the relative topology and the restric- 
tion F|y :U + V is a diffeomorphism. Hence the P-open set UNM P is 
diffeomorphic to the open set 0 := {y € R”™ "| (q,y) © V} Cc R™™” by the 
diffeomorphism ¢: UM P + Q, defined by ¢(p) := Ap for p € UN P, whose 
inverse is the smooth map w:Q—> UMP given by o(y) = (Flu) 1(¢,y) 
for y € Q. This shows that P is a smooth (m—n)-manifold in R*. 

Now let p € Pandv € T,P. Then there exists a smooth curve y : R + P 
such that y(0) = p and 4(0) = v. Since f(y(t)) = 0 for all t, we have 


d 


df(p)v = dl, 


f(y(t)) =0 
and so v € ker df(p). Hence T,P C kerdf(p) and equality holds because 
both J,P and kerdf(p) are (m — n)-dimensional linear subspaces of R*. 
This proves Theorem 2.2.19. 


2.2. TANGENT SPACES AND DERIVATIVES 33 


Proof 2. Here is another proof of Theorem 2.2.19 in local coordinates. Fix 
a point po € P and choose a coordinate chart $9 : Uo > ¢0(Uo) C R™ on 
an M-open neighborhood Up C M of po. Likewise, choose a coordinate 
chart wo : Vo > wo(VYo) C R” on an N-open neighborhood Vo Cc N of gq. 
Shrinking Up, if necessary, we may assume that f(Uo9) C Vo. Then the 
point co := wWo(q) is a regular value of the map 


fo := 0° f 045 : d0(Uo) > R". 


Namely, if x € ¢9(Up) satisfies fo(x) = co, then p := $5 (x) € Up NP, so the 
maps d¢,"(x):R™ 4 T,M, df(p):TpM 4 T,N, and dyo(q): TzN > R” 
are all surjective, hence so is their composition, and by the chain rule this 
composition is the derivative dfo(x) : R’ — R”. With this understood, it 
follows from Theorem 2.1.10 that the set 


fo ‘(co) = {x € bo(Uo) | f(49'(z)) = a} = b0(Uo NP) 


is a manifold of of dimension m—n contained in the open set ¢9(Uo) C R™. 
Using Definition 2.1.3 and shrinking Up further, if necessary, we may assume 
that the set @9(Up N P) is diffeomorphic to an open subset of R’’~”". Com- 
posing this diffeomorphism with ¢9 we find that Up M P is diffeomorphic to 
the same open subset of R™~”. Since the set Up C M is M-open, there 
exists an open set U C R* such that UM M = Up, hence UN P= Up NP, 
and so UgM P is a P-open neighborhood of po. Thus we have proved 
that every element po € P has a P-open neigborhood that is diffeomor- 
phic to an open subset of R’-". Thus P Cc R* is a manifold of dimen- 
sion m —n (Definition 2.1.3). The proof that the tangent spaces of P are 
given by T,P = ker df(p) remains unchanged and this completes the second 
proof of Theorem 2.2.19. 


Definition 2.2.20. Let MC R* and N CR* be smooth m-manifolds. A 
smooth map f :M— N is called a local diffeomorphism iff its deriva- 
tive df(p):T,M — Typ) N is a vector space isomorphism for every p € M. 


Example 2.2.21. The inclusion of an M-open subset U C M into M and 
the map R > S!:t+5 e” are examples of local diffeomorphisms. 


Exercise 2.2.22. Prove that the image of a local diffeomorphism is an open 
subset of the target manifold. Hint: Use the Inverse Function Theorem. 


In the terminology introduced in §2.3 and §2.6.1 below, local diffeomor- 
phisms are both immersions and submersions. In particular, if f : M— N 
is a local diffeomorphism, then every element gq € N is a regular value of f 
and its preimage f~'(q) is a discrete subset of M. 
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2.3. Submanifolds and Embeddings 


This section deals with subsets of a manifold M that are themselves mani- 
folds as in Definition 2.1.3. Such subsets are called submanifolds of M. 


Definition 2.3.1 (Submanifold). Let M Cc R* be an m-dimensional man- 
ifold. A subset L C M is called a submanifold of M of dimension ¢, iff L 
itself is an £-manifold. 


Definition 2.3.2 (Embedding). Let M Cc R* be an m-dimensional mani- 
fold and N C R° be an n-dimensional manifold. A smooth map f : N > M 
is called an immersion iff its derivatue df(q) :TyN > Ty(q)M is injective 
for every q € N. It is called proper if, for every compact subset kK C f(N), 
the preimage f—'(K) ={q€ N| f(q) € K} is compact. The map f is called 
an embedding iff it is a proper injective immersion. 


Remark 2.3.3. In our definition of proper maps it is important that the 
compact set K is required to be contained in the image of f. The literature 
also contains a stronger definition of proper which requires that f—'(K) is 
a compact subset of N for every compact subset kK C M, whether or not K 
is contained in the image of f. This holds if and only if the map f is proper 
in the sense of Definition 2.3.2 and has an M-closed image. (Exercise!) 


o(U) 


Figure 2.6: A coordinate chart adapted to a submanifold. 


Theorem 2.3.4 (Submanifolds). Let M Cc R* be an m-dimensional man- 
ifold and N C R° be an n-dimensional manifold. 

(i) If f: N > M is an embedding, then f(N) is a submanifold of M. 

(ii) If P C M is a submanifold, then the inclusion P + M is an embedding. 
(iii) A subset P C M is a submanifold of dimension n if and only if, for 
every po € P, there exists a coordinate chart 6: U > R™ on an M-open 
neighborhood U of po such that (UN P) = ¢(U) N(R” x {0}) (Figure 2.6). 
(iv) A subset P C M is a submanifold of dimension n if and only if, for ev- 
ery po € P, there exists an M-open neighborhood U C M of po and a smooth 
map g:U—+ R™” such that 0 is a regular value of g and UN P =g7(0). 


Proof. See page 36. 
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Lemma 2.3.5 (Embeddings). Let M and N be as in Theorem 2.3.4, 
let f: N > M be an embedding, let qg € N, and define 


P:=f(N), po:= f(g) € P. 


Then there exists an M-open neighborhood U C M of po, an N-open neigh- 
borhood V C N of qo, an open neighborhood W C R™—"™ of the origin, and 
a diffeomorphism F:V x W - U such that, for allq € V and all z € W, 


F(q,0) = f(q), (2.3.1) 
F(q,z)€P = 2=0. (2:3:2) 


Proof. Choose any coordinate chart ¢9 : Up > R™ on an M-open neighbor- 
hood Up C M of po- Then d(do fe) f)(q) = ddo(f (qo)) fo) df (qo) : To )N — R™ 
is injective. Hence there is a linear map B : R™—” > R” such that the map 


Ty) N x R™" + R™ : (w, 6) +4 d(do 0 f)(qo)w + BC (2.3.3) 
is a vector space isomorphism. Define the set 


Q:= {(42z) €NxR™™| f(g € Uo, bo(f(a)) + Bz € Go(Uo)} . 
This is an open subset of N x R’—” and we define F':Q — M by 


F(q,2) := $9‘ (¢0(f(q)) + Bz). 


This map is smooth, it satisfies F(q,0) = f(q) for all q € f~!(Uo), and 
the derivative dF(qo,0) : Tg,N x R™-" — T,,M is the composition of the 
map (2.3.3) with d¢o(po)~! : R™ > TM and so is a vector space isomor- 
phism. Thus the Inverse Function Theorem 2.2.17 asserts that there is an 
N-open neighborhood Vo C N of go and an open neighborhood Wp Cc R™—” 
of the origin such that Vo x Wo C Q, the set Up := F(Vo x Wo) is M-open, 
and the restriction of F' to Vo x Wo is a diffeomorphism onto Up. Thus we 
have constructed a diffeomorphism F’: Vo x Wo — Up that satisfies (2.3.1). 

We claim that the restriction of F' to the product V x W of sufficiently 
small open neighborhoods V C N of qo and W Cc R”™” of the origin also 
satisfies (2.3.2). Otherwise, there exist sequences g; € Vo converging to qo 
and z; € Wo \ {0} converging to zero such that F'(q;, z;) € P. Hence there 
exists a sequence g; € N such that F'(q, z:) = f(q;). This sequence converges 
to f(qo). Since f is proper we may assume, passing to a suitable subsequence 
if necessary, that g; converges to a point qj € N. Then 


F(qo) = Jim f(g) = Jim FG, zi) = f(g0).- 
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Since f is injective, this implies qj = qo. Hence (qj,0) € Vo x Wo for 7 
sufficiently large and F(q},0) = f(q}) = F(a, x). This contradicts the fact 
that the map Ff’: Vo x Wo — M is injective, and proves Lemma 2.3.5. 


Proof of Theorem 2.3.4. We prove (i). Let qo € N, denote po := f(qo) € P, 
and choose a diffeomorphism F': V x W — U as in Lemma 2.3.5. Then the 
set V CN is diffeomorphic to an open subset of R” (after schrinking V 
if necessary), the set UM P is P-open because U C M is M-open, and we 
have UN P= {F(q,0)|¢e€V} =f(V) by (2.3.1) and (2.3.2). Hence the 
map f:V UMP is a diffeomorphism whose inverse is the composition 
of the smooth maps F-!:UNP—+VxW and VxW 3V:(q,2z) 4. 
Hence a P-open neighborhood of pg is diffeomorphic to an open subset of R”. 
Since po € P was chosen arbitrary, this shows that P is an n-dimensional 
submanifold of M. 

We prove (ii). The inclusion 1: P > M is obviously smooth and in- 
jective (it extends to the identity map on R*). Moreover, TpP C TpM for 
every p € P and the derivative di(p) : T,P — T,M is the obvious inclusion 
for every p € P. That v is proper follows immediately from the definition. 
Hence ¢ is an embedding. 

We prove (iii). If a coordinate chart @o as in (iii) exists, then the 
set Up N P is P-open and is diffeomorphic to an open subset of R”. Since the 
point po € P was chosen arbitrary this proves that P is an n-dimensional 
submanifold of M. Conversely, suppose that P is an n-dimensional sub- 
manifold of M and let pgp € P. Choose any coordinate chart @9 : Up — R™ 
of M defined on an M-open neighborhood Up C M of po. Then ¢9(Uo N P) 
is an n-dimensional submanifold of R™. Hence Theorem 2.1.10 asserts 
that there are open sets V,W CR” with po € V C do(Uo) and a diffeo- 
morphism ~:V — W such that 


bo(po) € V, W(V 9 go(Uo 1 P)) = WN (R® x {0}). 


Now define U := by (V) C Up. Then pp € U, the chart do restricts to a 
diffeomorphism from U to V, the composition ¢ := wo ¢do|y : U > W isa 
diffeomorphism, and ¢(UN P) = ¥(VN do(Uo Nn P)) =W N(R” x {O}). 
We prove (iv). That the condition is sufficient follows directly from 
Theorem 2.2.19. To prove that it is necessary, assume that PC M, is a 
submanifold of dimension n, fix an element po € P, and choose a coordi- 
nate chart 6: U — R™ on an M-open neighborhood U C M of po as in 
part (iii). Define the map g: U — R™™” by g(p) := (¢@n4i(p),---;@m(p)) 
for p€ U. Then 0 is a regular value of g and g~!(0) =UNP. This proves 
Theorem 2.3.4. 
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Example 2.3.6. Let S' C R? = C be the unit circle and consider the map 
f : S! — R? given by f(a, y) := (a, ry). This map is a proper immersion but 
is not injective (the points (0,1) and (0,—1) have the same image under f). 
The image f(S') is a figure 8 in R? and is not a submanifold (Figure 2.7). 


Figure 2.7: A proper immersion. 


Example 2.3.7. Consider the restriction of the map f in Example 2.3.6 to 
the submanifold N := S! \ {(0,—-1)}. The resulting map f : N > R? is an 
injective immersion but it is not proper. It has the same image as before 
and hence f(NV) is not a manifold. 

Example 2.3.8. The map f : R > R? given by f(t) := (¢?,t?) is proper 
and injective, but is not an embedding (its derivatuve at ¢ = 0 is not in- 
jective). The image of f is the set f(IR) =C := {(x,y) € R?|23 = y”} (see 
Figure 2.8) and is not a submanifold. (Prove this!) 


Figure 2.8: A proper injection. 


Example 2.3.9. Define the map f : R > R? by f(t) := (cos(t), sin(t)). This 
map is an immersion, but it is neither injective nor proper. However, its 
image is the unit circle in R? and hence is a submanifold of R?. The 
map R— R?:t+> f(t?) is not an immersion and is neither injective nor 
proper, but its image is still the unit circle. 
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2.4 Vector Fields and Flows 


2.4.1 Vector Fields 


Definition 2.4.1 (Vector field). Let M C R* be a smooth m-manifold. A 
(smooth) vector field on M is a smooth map X : M — R* such that 


X(p) € T,M 
for every p€ M. The set of smooth vector fields on M will be denoted by 
Vect(M) := {x : M > R*|X is smooth, X(p) € T,M for all p € u} : 


Exercise 2.4.2. Prove that the set of smooth vector fields on M is a real 
vector space. 


Example 2.4.3. Denote the standard cross product on R® by 


L2Y3 — L3Y2 
TX Ys= T3Y1 — ©1Y3 
X1Y2 — L2Y1 


for z,y € R°. Fix a vector € € S? and define the maps X,Y : S* > R® by 
X(p):=€xp, — Y(p) = (€ x p) x p. 


These are vector fields with zeros +€. Their integral curves (see Defini- 
tion 2.4.6 below) are illustrated in Figure 2.9. 


Figure 2.9: Two vector fields on the 2-sphere. 


Example 2.4.4. Let M := R?. A vector field on M is then any smooth 
map X : R? > R?. As an example consider the vector field 


X(z,y) = (x, —y): 


This vector field has a single zero at the origin and its integral curves are 
illustrated in Figure 2.10. 
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Figure 2.10: A hyperbolic fixed point. 


Example 2.4.5. Every smooth function f : R™ — R determines a gradient 
vector field 


X=Vf:=| 072 |: R™SR™. 


of 


OLm 


Definition 2.4.6 (Integral curve). Let M C R* be a smooth m-manifold, 
let X € Vect(M) be a smooth vector field on M, and let I CR be an open 
interval. A smooth map y: I — M is called an integral curve of X iff it 
satisfies the equation 


for everyt € I. 


Theorem 2.4.7. Let M Cc R* be a smooth m-manifold and X € Vect(M) 
be a smooth vector field on M. Fix a point po € M. Then the following 
holds. 

(i) There is an open interval I C R containing 0 and a smooth curve 
y:I—M satisfying the equation 


Vt) =X(7(), (0) = Po (2.4.1) 


for everyt € I. 


(ii) If yi: th > M and 72: Ip > M are two solutions of (2.4.1) on open 
intervals I, and Iz containing 0, then 71(t) = y(t) for everyt € ON Ip. 
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(ae. 


: 


Figure 2.11: Vector fields in local coordinates. 


Proof. We prove (i). Let ¢9 : Up > R™ be a coordinate chart on M, defined 
on an M-open neighborhood Up C M of po. The image of ¢p is an open set 
Q := ¢9(Uo) C R™ and we denote the inverse map by wo := bo :Q3 M 
(see Figure 2.11). Then, by Theorem 2.2.3, the derivative dio(x) : R™ > R* 
is injective and its image is the tangent space Ty,(,)M for every x € 2. 
Define f : Q 4 R™ by f(x) := dyo(x)1X(Wo(z)) for c €Q. This map 
is smooth and hence, by the basic existence and uniqueness theorem for 
ordinary differential equations in R™ (see [63]), the equation 


&(t) = f(w(t)), — &(0) = 0 == do(po), (2.4.2) 


has a solution « : J > Q on some open interval J C R containing 0. Hence 
the function y := wp oa: I — Ug C M is asmooth solution of (2.4.1). This 
proves (i). a 

The local uniqueness theorem asserts that two solutions 7; : [; ~ M 
of (2.4.1) for 1 = 1,2 agree on the interval (—e,e) C 1 MN Ig for e > 0 
sufficiently small. This follows immediately from the standard uniqueness 
theorem for the solutions of (2.4.2) in [63] and the fact that 7: I > Qisa 
solution of (2.4.2) if and only if y := a o2: I — Up is a solution of (2.4.1). 

To prove (ii) we observe that the set I := I, M Iz is an open interval 
containing zero and hence is connected. Now consider the set 


A:= {te I| y(t) = r2(t)}. 


This set is nonempty, because 0 € A. It is closed, relative to I, because the 
maps 71:1 — M and y2: I — M are continuous. Namely, if t; € J is a 
sequence converging to t € J, then y(t;) = ye(ti) for every i and, taking 
the limit i + oo, we obtain 71(t) = y2(t) and hence t € A. The set A is also 
open by the local uniqueness theorem. Since J is connected it follows that 
A =I. This proves (ii) and Theorem 2.4.7. 
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2.4.2 The Flow of a Vector Field 


Definition 2.4.8 (The flow of a vector field). Let M Cc R* be a smooth 
m-manifold and X € Vect(M) be a smooth vector field on M. For po © M 
the maximal existence interval of po is the open interval 


oo) =U 


By Theorem 2.4.7 equation (2.4.1) has a solution y : I(po) + M. The flow 
of X is the map ¢6:D— M defined by 


D := {(t, po) |po € M, t € I(po)} 


and $(t, po) := y(t), where y: I(po) > M is the unique solution of (2.4.1). 


Theorem 2.4.9. Let M C R* be a smooth m-manifold and X € Vect(M) 
be a smooth vector field on M. Let 6: D— M be the flow of X. Then the 
following holds. 


(i) D is an open subset of Rx M. 
(ii) The map 6: D > M is smooth. 
(iii) Let pp € M and s € I(po). Then 


I CR is an open interval containing 0 
and there is a solution y: I > M of (2.4.1) 


I($(s, po)) = I(po) — s (2.4.3) 
and, for everyt € R with s+t € I(po), we have 
o(s +t, po) = O(t, O(5, Do). (2.4.4) 


Proof. See page 42. 


Lemma 2.4.10. Let M, X, D, ¢ be as in Theorem 2.4.9 and let K C M be 
a compact set. Then there exists an M-open set U C M and ane > 0 such 
that K CU, (-e,¢e) x U CD, and ¢ is smooth on (—e,€) x U. 


Proof. In the case where M = (2. is an open subset of R™ this was proved 
in [64, Thm 4.1.4]. Using local coordinates we deduce (as in the proof 
of Theorem 2.4.7) that, for every p € M, there exists an M-open neighbor- 
hood U, C M of pand an €p > 0 such that (—ép, €p) x Up C D and the restric- 
tion of ¢ to (—é€p, €p) X Up is smooth. Using this observation for every p € Kk 
(and the axiom of choice) we obtain an M-open cover K C Unex Up- Since 
the set K is compact there exists a finite subcover K C Up, U--:UUpy =: U. 
Now define € := min{ép,,...,€p, } to deduce that (—e,e) x U C D and ¢ is 
smooth on (—e,¢) x U. This proves Lemma 2.4.10. 
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Proof of Theorem 2.4.9. We prove (iii). The map 7 : I(po)—s — M defined 
by y(t) := @(s+t, po) is a solution of the initial value problem +(t) = X (7(t)) 
with 7(0) = $(s,po). Hence I(po) — s C I(¢(s,po)) and equation (2.4.4) 
holds for every t € R with s +t € I(po). In particular, with t = —s, we 
have po = $(—s, ¢(s,po)). Thus we obtain equality in equation (2.4.3) by 
the same argument with the pair (s.po) replaced by (—s,@(s,po)). 

We prove (i) and (ii). Let (to,p9) € D so that po € M and to € I (po). 
Suppose t) > 0. Then K := {¢(t,po)|0<t< to} is a compact subset 
of M. (It is the image of the compact interval [0,to] under the unique 
solution y:I(po) > M of (2.4.1).) Hence, by Lemma 2.4.10, there is an 
M-open set U C M and ane > 0 such that 


K CU, (-e,e)x UCD, 


and ¢ is smooth on (—e,¢) x U. Choose N so large that to/N < ¢. Define 
Uo := U and, for k = 1,...,.N, define the sets U; C M inductively by 


Ux := {p € U| b(to/N, p) € Ux_-i} . 


These sets are open in the relative topology of M. 

We prove by induction on k that (—e,kto/N +e) x Ux, C D and ¢ is 
smooth on (—e,kto/N + ¢) x U,. For k = 0 this holds by definition of ¢ 
and U. If k € {1,...,N} and the assertion holds for k — 1, then we have 


peu, => peu, $(to/N,p) € Up-1 
=> (-€,€) CI(p), (-e, (Kk — 1)to/N + €) C I(G(to/N, p)) 
=> (-, kto/N +2) C Hp). 
Here the last implication follows from (2.4.3). Moreover, for p € Ux and 


to/N —E <t < kto/N +¢, we have, by (2.4.4), that 


(t, p) = p(t = to/N, b(to/N, P)) 


Since ¢(to/N,p) € Ux_i for p € Ux the right hand side is a smooth map 
on the open set (to/N — €,kto/N +) x Ux. Since Ux C U, ¢ is also a 
smooth map on (—e,¢) x Ux, and hence on (—e,kto/N + €) x Ux. This 
completes the induction. With k = N we have found an open neighborhood 
of (to, po) contained in D, namely the set (—e, to +¢) x Uy, on which ¢ is 
smooth. The case to < 0 is treated similarly. This proves (i) and (ii) and 
Theorem 2.4.9. 


Definition 2.4.11. A vector field X € Vect(M) is called complete iff, for 
each po € M, there exists an integral curve y: R— M of X with y(0) = po. 
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Lemma 2.4.12. Let M Cc R* be a compact manifold. Then every vector 
field on M is complete. 


Proof. Let X € Vect(M). It follows from Lemma 2.4.10 with kK = M 
that there exists an ¢ > 0 such that (—e,¢) C I(p) for all p € M. By Theo- 
rem 2.4.9 this implies [(p) = R for all p € M. Hence X is complete. 


Let M C R* be a smooth manifold and X € Vect(M). Then 
X iscomplette <= I(p)=RVpEM <= D=RxM. 


Assume X is complete, let 6: R x M — M be the flow of X, and define 
the map ¢' : M > M by ¢'(p) := $(t,p) fort € R and p € M. Then 
Theorem 2.4.9 asserts that ¢° is smooth for every t € R and that 

ptt = go f*, g° =id (2.4.5) 
for all s,t € R. In particular this implies that ¢'0 ¢' = ¢'to d! = id. 
Hence ¢° is bijective and (¢°)~' = ¢~, so each ¢! is a diffeomorphism. 
Exercise 2.4.13. Let M C R* be asmooth manifold. A vector field X on M 
is said to have compact support iff there exists a compact subset K C M 


such that X(p) = 0 for every p € M \ K. Prove that every vector field with 
compact support is complete. 


We close this subsection with an important observation about incomplete 
vector fields. The lemma asserts that an integral curve on a finite existence 
interval must leave every compact subset of M. 


Lemma 2.4.14. Let M C R* be a smooth m-manifold, let X € Vect(M), 
let@¢:D— M be the flow of X, let K C M be a compact set, and let pp © M 
be an element such that 


I(po) N (0,00) = [0, ), bb ae. 
Then there exists a number 0 < tr < b such that 
th <t<b => b(t,po) € M\ K 


Proof. By Lemma 2.4.10 there exists an ¢ > 0 such that (—e,¢) C I(p) for 
every p € K. Choose € so small that « < b and define 


ty :=b-—e>Q0. 


Choose a real number ty <t <b. Then I(@(t,p0)) 9 [0,00) = [0,b — t) by 
equation (2.4.3) in part (ii) of Theorem 2.4.9. Since 0 < b—t<b—tK =e, 
this shows that (—e,¢) ¢ I(@(t,po)) and hence ¢(t,po) ¢ K. This proves 
Lemma 2.4.14. 
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The next corollary is an immediate consequence of Lemma 2.4.14. In 
this formulation the result will be used in §4.6 and in §7.3. 
Corollary 2.4.15. Let M C R* be a smooth m-manifold, let X € Vect(M), 
and let y : (0,1) > M be an integral curve of X. If there exists a compact 
set K C M that contains the image of y, then y extends to an integral curve 
of X on the interval (—p,T + p) for some p> 0. 
Proof. Here is another more direct proof that does not rely on Lemma 2.4.10. 
Since K is compact, there exists a constant c > 0 such that |X(p)| < c for 
all p € K. Since y(t € Kk for 0 <t<T, this implies 


v(t) - r(s)| = < fie Nhdr= fixe ))| dr < c(t — 5) 


for0 <s<t<T. Thus the limit po := lim,\o 7(t) exists in R* and, since K 
is a closed subset of R*, we have po € K C M. Define yo : (0,1) + M by 


J Po, for t = 0, 
Yolt) = { i), torl< 2 <7, 


We prove that 7 is differentiable at t = 0 and 40(0) = X (po). To see this, fix 
a constant ¢ > 0. Since the curve [0,T) + R* : t+ X(7/(t)) is continuous, 
there exists a constant 6 > 0 such that 


0<t<6 == —|X(o(t)) — X(po)| <. 


r)dr| < 


Hence, for 0 <5 <t<6, we have 


ly(t) — y(s) — (— 8) X (po) = 


<f IX((r)) — X (po) ar 
(t — sje. 
Take the limit s — 0 to obtain 
y(t) — p 
WO PO _ x(p9) 


for 0<t<06. Thus 7 is differentiable at t = 0 with 40(0) = X(po), as 
claimed. Hence 7 extends to an integral curve 7: (—p,T) > M of X for 
some p > 0 via 7(t) := o(t, po) for —p < t < Oand Y(t) := y(t) for0 << t < T. 
Here ¢ is the flow of X. That y also extends beyond t = T,, follows by re- 
placing 7(t) with y([—t) and X with —X. This proves Corollary 2.4.15. 


ly(t) — y(s) — (= 8) X(po)| 
t—s 


= lim <e 
s—0 
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The Group of Diffeomorphisms 
Let us denote the space of diffeomorphisms of M by 
Diff(M) := {¢: M > M| ¢ is a diffeomorphism} . 
This is a group. The group operation is composition and the neutral element 


is the identity. Now equation (2.4.5) asserts that the flow of a complete 
vector field X € Vect(M) is a group homomorphism 


R — Diff(M) : tw ¢*. 


This homomorphism is smooth and is characterized by the equation 
dy t 0 
Hu? P) =X), oF (@) =P 
for allp € M andt € R. We will often abbreviate this equation in the form 
oo =Xo¢d', ¢° = id. (2.4.6) 


Exercise 2.4.16 (Isotopy). Let M C R* bea compact manifold and J C R 
be an open interval containing 0. Let 


Ix M>R*: (t,p) 1 Xi(p) 


be a smooth map such that X; € Vect(V/) for every t. Prove that there is 
a smooth family of diffeomorphisms I x M — M : (t,p) +> ¢(p) satisfying 


d : 
ue = X10 $1, go = id (2.4.7) 
for every t € I. Such a family of diffeomorphisms 
I > Diff(M): th & 


is called an isotopy of MM. Conversely prove that every smooth isotopy 
I > Diff(M) : t+ ¢; is generated (uniquely) by a smooth family of vector 
fields I + Vect(M): tr X}. 
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2.4.3 The Lie Bracket 


Let Mc R* and N C R* be smooth m-manifolds and X € Vect(M) be 
smooth vector field on M. If w : N —> M is a diffeomorphism, then the 
pullback of X under yw is the vector field on N defined by 


(b* X)(q) = dv(q) | X(¥(q)) (2.4.8) 


for qe N. If 6: M > N is a diffeomorphism, then the pushforward of X 
under w is the vector field on N defined by 


(6.X)(q) = dd(e*(q))X(¢ *(q)) (2.4.9) 
forge N. 


Lemma 2.4.17. Let M Cc R*, NCR‘, and PC R® be smooth m-dimen- 
sional submanifolds and let X € Vect(M) and Z € Vect(P). Then 


o.X = (Pp 1)*X (2.4.10) 
and 
(bo P)sX =UbeX, (od)! Z= Gry" Z. (2.4.11) 
Proof. Equation (2.4.10) follows from the fact that 


do~'(q) = db(o"'(q))7* : TyN + Ty-1(q)M 


for all g € N (Corollary 2.2.15) and the equations in (2.4.11) follow directly 


from the chain rule (Theorem 2.2.14). This proves Lemma 2.4.17. 


We think of a vector field on M as a smooth map 
X:M>R 


that satisfies the condition X(p) € T,M for every p € M. Ignoring this 
condition temporarily, we can differentiate X as a map from M to R* and 
its derivative at p is then a linear map 


dX (p) :T,M > R*. 


In general, this derivative will not take values in the tangent space 7M. 
However, if we have two vector fields X and Y on M, then the next lemma 
shows that the difference of the derivative of X in the direction Y and of Y 
in the direction X does take values in the tangent spaces of M. 
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Lemma 2.4.18 (Lie bracket). Let M C R* be a smooth m-manifold and 
let X,Y € Vect(M) be complete vector fields. Denote by 

R > Diff(M) : tr ¢, R > Diff(M): tury! 


the flows of X and Y, respectively. Fix a point p © M and define the smooth 
map y:R— M by 


a(t) = grogtod ov). (2.4.12) 
Then ¥(0) =0 and 


d 


dt (vt) = 57(0) 


d 


t=0 


| (@Y)) nists 


“)  ((u)*X) @) 


dtl, 
= 4X(p)¥(p)-d¥()X(p) TM. 


Exercise 2.4.19. Let y : R > R* be a C?-curve and assume ¥(0) = 0. 
Prove that $19 WAVE) = lim;-59 t?(4(t) — 7(0)) — 57(0). 


Proof of Lemma 2.4.18. Define the map 2 : R? + M by 
B(s,t) = Pr opog Pop (p) 
for s,t € R. Then 7(t) = G(t,t) and 
OB 


95 Ort) = X(P) — def" (b~"(p))X(¥"(v)), (2.4.14) 
2,0) = dp*(¢-*(p))¥ (¢ *(p)) — Y(p). (2.4.15) 


for all s,t €¢ R. Hence 


a a 
4(0) =~ ,0) + 20,0) =0. 


This implies the first equality in (2.4.13) by Exercise (2.4.19). To prove 


the remaining assertions, note that 6(s,0) = 6(0,t) = p, hence the second 
derivatives 076/0s? and 073/0t? vanish at s = t = 0, and therefore 


0’B 
=92 = 
Osdt 


4(0) (0,0). (2.4.16) 
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Combining equations (2.4.15) and (2.4.16) we find 


5400) = 5 


Likewise, combining equations (2.4.14) 


~ Os 


s= 


28 (4,0) = 


) 
9 ot 


do*(e *(p))Y (bo *(p) 


s=0 


a 
ds 


(2.4.16) we find 


In both cases the right hand side is the derivative of a smooth curve in the 
tangent space T,,M and so is itself an element of TM. Moreover, we have 


This proves Lemma 2.4.18. 


< do*(o-*(p) )¥ (6-8 (p)) 
s=0 
fa) 0 sit 1-8 
Glo tile 
ra) O st 1-8 
Mea calege 
A (X(b'(p)) — deb" (p) X(p)) 
t=0 
dX (p)Y (p) — Fl < - yt o o°(p) 
dX (p)Y (p) — 2 2 o i yt o f*(p) 
dX (p)Y (p) — < ee) 
dX (p)Y (p) — dY (p) X(p). 
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Definition 2.4.20 (Lie bracket). Let M Cc R* be a smooth manifold and 
let X,Y € Vect(M) be smooth vector fields on M. The Lie bracket of X 
and Y is the vector field [X,Y] € Vect(M) defined by 


[X, Y](p) := dX (p)Y¥ (p) — dY (p)X(p). (2.4.17) 


Warning: In the literature on differential geometry the Lie bracket of two 
vector fields is often (but not always) defined with the opposite sign. The 
rationale behind the present choice of the sign will be explained in §2.5.7. 


Lemma 2.4.21. Let MC R*® and N C R° be smooth manifolds, let X,Y, Z 
be smooth vector fields on M, and let 


o:N>M 
be a diffeomorphism. Then 
o |x. ¥ | =o Ao YI, (2.4.18) 
[X,Y] + [Y, X] =0, (2.4.19) 
[X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y]] = 0. (2.4.20) 


The last equation is called the Jacobi identity 
Proof. Let R > Diff(M) : t+ v' be the flow of Y. Then the map 
R > Diff(N):tu gd loyod 
is the flow of the vector field ¢*Y on N. Hence, by Lemma 2.4.17 and 
Lemma 2.4.18, we have 
d 


[PxX, oY] = — 


FO row ony ox 


t=0 

d 

=| # (y" 

dlp? 
= ¢"|X,Y]. 

This proves (2.4.18). Equation (2.4.19) is obvious. To prove (2.4.20), let o! 

be the flow of X. Then by (2.4.18) and (2.4.19) and Lemma 2.4.18 we have 


xX 


Ly Z\,X] = 


This proves Lemma, 2.4.21. 
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Definition 2.4.22. A Lie algebra is a real vector space g equipped with 
a skew-symmetric bilinear map g x g > g: (€,7) > [€,7] that satisfies the 
Jacobi identity [€, [n, ¢]] + [n, [¢,€]] + [¢, [ml] = 0 for all €,n,¢ € g. 


Example 2.4.23. The Vector fields on a smooth manifold M C R* form a 
Lie algebra with the Lie bracket (2.4.17). The space gl(n, R) = R”*” of real 
n X n-matrices is a Lie algebra with the Lie bracket 


[E,n] = &n — m6. 


It is also interesting to consider subspaces of gl(n, R) that are invariant under 
this Lie bracket. An example is the space 


o(n) = . E gi(n,R)|€7 +E = of 


of skew-symmetric n x n-matrices. It is a nontrivial fact that every finite- 
dimensional Lie algebra is isomorphic to a Lie subalgebra of gl(n,R) for 
some n. For example, the cross product defines a Lie algebra structure 
on R? and the resulting Lie algebra is isomorphic to 0(3). 


Remark 2.4.24. There is a linear map R”*™ — Vect(R™) : € + X¢ which 
assigns to a matrix € € gl(m,R) the linear vector field X¢ : R™ > R™ 
given by X¢(x) := €x for « € R™. This map preserves the Lie bracket, 
ie. [X¢, X,] = Xj¢n], and hence is a Lie algebra homomorphism. 


To understand the Lie bracket geometrically, consider again the curve 
y(t) = prop og’ op "(p) 


in Lemma 2.4.18, where ¢° and y" are the flows of the vector fields X and Y, 
respectively. Since +(0) = 0, Exercise 2.4.19 asserts that 


[X, Y](p) = x00) ~ “ ey gVropiog Mod MQ). (2.4.21) 


Geometrically this means that by following first the backward flow of Y 
for time ¢, then the backward flow of X for time ¢, then the forward flow 
of Y for time ¢, and finally the forward flow of X for time ¢, we will not, in 
general, get back to the original point p where we started but approximately 
obtain an “error” e?[X,Y](p). An example of this (which we learned from 


Donaldson) is the mathematical formulation of parking a car. 
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Example 2.4.25 (Parking a car). The configuration space for driving a 
car in the plane is the manifold M := C x S', where S' C C denotes the 
unit circle. Thus a point in M is a pair p = (z,A) € C x C with |A| = 1. The 
point z € C represents the position of the car and the unit vector \ € S! 
represents the direction in which it is pointing. The left turn is represented 
by a vector field X and the right turn by a vector field Y on M. These 
vector field are given by X(z,A) := (A,iA) and Y(z,A) := (A, —iA). Their 
Lie bracket is the vector field [X,Y](z,A) = (—2i\,0). This vector field 
represents a sideways move of the car to the right. And a sideways move 
by 2e? can be achieved by following a backward right turn for time ¢, then 
a backward left turn for time ¢, then a forward right turn for time ¢, and 
finally a forward left turn for time e. 

This example can be reformulated by identifying C with R? via z = x+iy 
and representing a point in the unit circle by the angle 6 € R/27Z via 
= e!®. In this formulation the manifold is M = R? x R/27Z, a point in M 
is represented by a triple (x, y,0) € R°, the vector fields X and Y are 


X (x,y, 0) := (cos(@), sin(@), 1), Y(z,y,@) := (cos(0), sin(@), —1), 
and their Lie bracket is [X,Y ](x, y, 0) = 2(sin(0), — cos(@), 0). 


Lemma 2.4.26. Let X,Y © Vect(M) be complete vector fields on a man- 
ifold M and $',' € Diff(M) be the flows of X and Y, respectively. Then 
the Lie bracket |X, Y] vanishes if and only if the flows of X and Y commute, 
i.e. Pow =y'o @’ for all s,t ER. 


Proof. If the flows of X and Y commute, then the Lie bracket [X, Y] vanishes 
by Lemma 2.4.18. Conversely, suppose that [X,Y] = 0. Then we have 


d 8 = s d r = s = 
for every s € R and hence 
(¢°)4¥ = Y. (2.4.22) 


Fix a real number s and define the curve y: R— M by 4(t) := ¢°(v"(p)) 
fort € R. Then 7(0) = ¢*(p) and 


9(t) = dd*(w"(p))Y (H"(v)) = 8") Y) (9) = YO) 


for all t. Here the last equation follows from (2.4.22). Since 7 is the flow of 
Y we obtain y(t) = ¥'(%(p)) for all t € R and this proves Lemma 2.4.26. 


Exercise 2.4.27. In the situation of Lemma 2.4.26 prove that {¢! ow ier 
is the flow of the vector field X + Y. 
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2.5 Lie Groups 


Combining the concept of a group and a manifold, it is interesting to consider 
groups which are also manifolds and have the property that the group op- 
eration and the inverse define smooth maps. We shall only consider groups 
of matrices. 


2.5.1 Definition and Examples 


Definition 2.5.1 (Lie group). A nonempty subset G C R"*” is called a 
Lie group iff it is a submanifold of R"“”" and a subgroup of GL(n,R), i.e. 


(where gh denotes the product of the matrices g and h) and 
gEG => det(g) 40 andg' €G. 


(Since G # it follows from these conditions that the identity matrix 1 is 
an element of G.) 


Example 2.5.2. The general linear group G = GL(n,R) is an open subset 
of R”*” and hence is a Lie group. By Exercise 2.1.19 the special linear group 


SL(n, R) = {g € GL(n, R)| det(g) = 1} 
is a Lie group and, by Example 2.1.20, the special orthogonal group 
SO(n) = {9 € GL(n,R)| g'g = 1, det(g) = i} 


is a Lie group. In fact every orthogonal matrix has determinant +1 and 
so SO(n) is an open subset of O(n) (in the relative topology). 

In a similar vein the group GL(n, C) := {g € C”*" | det(g) 4 0} of com- 
plex matrices with nonzero (complex) determinant is an open subset of C”*” 
and hence is a Lie group. As in the real case, the subgroups 


SL(n, C) := {g € GL(n, C) | det(g) = 1}, 
U(n) = {g € GL(n,C)|g°g = 1}, 
SU(n) := {g € GL(n, C)|g"g = 1, det(g) = 1} 
aT 


are submanifolds of GL(n,C) and hence are Lie groups. Here g* := g 
denotes the conjugate transpose of a complex matrix. 


Exercise 2.5.3. Prove that SL(n,C), U(n), and SU(n) are Lie groups. 
Prove that SO(n) is connected and that O(n) has two connected components. 
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Exercise 2.5.4. Prove that GL(n, C) can be identified with the group 
—1 
G:= {8 € GL(2n,R)|6y = Jd}, gx ( ; 5 ) . 


Hint: Use the isomorphism R” x R” > C”: (x,y) + «+ iy. Show that a 
matrix ® € R?”*?" commutes with Jo if and only if it has the form 


_ xX —-Y nxn 
o=(} ee Xen 


What is the relation between the real determinant of ® and the complex 
determinant of X + iY? 


Exercise 2.5.5. Let Jo be as in Exercise 2.5.4 and define 

Sp(2n) = {wv € GL(2n,R)| UT Sov = Jo} 
This is the symplectic linear group. Prove that Sp(2n) is a Lie group. 
Hint: See [49, Lemma 1.1.12]. 


Example 2.5.6 (Unit quaternions). The quaternions form a four- 
dimensional associative unital algebra H, equipped with a basis 1,i,j,k. 
The elements of HI are vectors of the form 


=x + ix, +jro+kzx3 0,21, 22,23 ER. (2.5.1) 


The product structure is the bilinear map H x H > H: (2, y) > xy, deter- 
mined by the relations 


?P=jp=kK=-1, ij=-ji=k, jk=—-kj=i, ki=-—ik=j. 


This product structure is associative but not commutative. The quaternions 
are equipped with an involution H > H: x + 2, which assigns to a quater- 
nion x of the form (2.5.1) its conjugate Z := xp — ix, — jrg —kz3. This 
involution satisfies the conditions 


EyY==L+9, iy = 92, 2k =|z)*, |zy| = |x| |y| 


for x,y € H, where |z| := 4/ te + «3 + 73 + x} denotes the Euclidean norm 
of the quaternion (2.5.1). Thus the unit quaternions form a group 


~ $p(1) := {a € HI |x| = 1} 


with the inverse map z+> &. Note that the group Sp(1) is diffeomorphic 
to the 3-sphere S* C R* under the isomorphism H ~ R*. Warning: The 
unit quaternions (a compact Lie group) are not to be confused with the 
symplectic linear group in Exercise 2.5.5 (a noncompact Lie group) despite 
the similarity in notation. a 
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Let G C GL(n, R) be a Lie group. Then the maps 
GxG>G:(g,h) 4 gh, Gs3CGrgeg 


are smooth (see [64]). Fixing an element h € G we find that the derivative 
of the map G > G: g gh at g € G is given by the linear map 


T= 1 1Gs Gi oh (2.5.2) 


Here g and h are both matrices in R"*” and gh denotes the matrix prod- 
uct. In fact, if g € T,G, then, since G is a manifold, there exists a smooth 
curve y: R > G with 7(0) = g and 4(0) = g. Since G is a group we obtain 
a smooth curve 6 : R > G given by {(t) := y(t)h. It satisfies (0) = gh and 
so gh = B(0) € TynG. 

The linear map (2.5.2) is obviously a vector space isomorphism whose 


inverse is given by right multiplication with h~!. It is sometimes convenient 
to define the map Rp, : G > G by 


for g € G (right multiplication by h). This is a diffeomorphism and the linear 
map (2.5.2) is the derivative of R;, at g, so 


dRr(g)g = gh for g © 7,G. 


Similarly, each element g € G determines a diffeomorphism L, : G — G, 
given by 
Lg(h) := gh 

for h € G (left multiplication by g). Its derivative at h € G is again given by 
matrix multiplication, i.e. the linear map dLg(h) : T,G — Ty,G is given by 

dLg(hyh =gh for h € T),G. (2.5.3) 
Since Lg is a diffeomorphism its derivative dL,(h) : T,G > Ty,G is again a 
vector space isomorphism for every h € G. 
Exercise 2.5.7. Prove that the map G > G: gt g7! is a diffeomorphism 


and that its derivative at g € G is the vector space isomorphism 


TygG > T,1G:ur% =o vg 


Hint: Use [64] or any textbook on first year analysis. 
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2.5.2 The Lie Algebra of a Lie Group 


Let 
G c GL(n, R) 


be a Lie group. Its tangent space at the identity matrix 1 € G is called the 
Lie algebra of G and will be denoted by 


g= Lie(G) — TyG. 


This terminology is justified by the fact that g is in fact a Lie algebra, i.e. 
it is invariant under the standard Lie bracket operation 


[E, n] = &n — n€ 


on the space R"*” of square matrices (see Lemma 2.5.9 below). The proof 
requires the notion of the exponential matrix. For € € R"“" andté€R 
we define 

oo thek 

exp(t&) := -: (2.5.4) 

k=0 
A standard result in first year analysis asserts that this series converges 
absolutely (and uniformly on compact t-intervals), that the map 


R—- RR" : t+ exp(té) 


is smooth and satisfies the differential equation 


© exp(té) = &explté) = explté)é, (2.5.5) 
and that 
exp((s+ 1)€) =exp(sé)exp(té),  exp(0€)= 1 (2.5.6) 


for all s,t € R. This shows that the matrix exp(té) is invertible for each t 
and that the map R + GL(n, R) : t+ exp(t€) is a group homomorphism. 
Exercise 2.5.8. Prove the following analogue of (2.4.12). For €,7 € g 


7 - exp(V#§) exp(v'tn) exp(—vt€) exp(—vtn) = [E,n] (2.5.7) 


In other words, the infinitesimal Lie group commutator is the matrix com- 
mutator. (Compare Equations (2.5.7) and (2.4.21).) 
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Lemma 2.5.9. Let G C GL(n,R) be a Lie group and denote by g := Lie(G) 
its Lie algebra. Then the following holds. 


(i) If € € g, then exp(t&) € G for everyt ER. 

(ii) Ifg € G andn€ gq, then gng! € g. 

(iii) If, € g, then [€,n] = &n—n€ € g. 

Proof. We prove (i). For every g € G we have a vector space isomor- 


phism g = 7)G > T,G: €+ €g as in (2.5.2). Hence each element € € g 
determines a vector field X¢ € Vect(G), defined by 


Xze(g) = €9 € TG, g EG. (2.5.8) 


By Theorem 2.4.7 there is an integral curve y : (—é,¢) > G satisfying 


V(t) = Xe(7(t)) =r), (0) = 1. 


By (2.5.5), the curve (—e,¢) > R"*” : t+ exp(t€) satisfies the same initial 
value problem and hence, by uniqueness, we have exp(t€) = y(t) € G for 
all t € R with |t| <e. Now let t € R and choose N €N such that || <e. 
Then exp(7&) € G and hence it follows from (2.5.6) that 


t \N 
exp(t€) = exp (+8) EG. 


This proves (i). 
We prove (ii). Consider the smooth curve y : R > R"*” defined by 


y(t) = gexp(tn)g*. 


By (i) we have y(t) € G for every t € R. Since 7(0) = 1 we have 


This proves (ii). 
We prove (iii). Define the smooth map 7, : R > R"*” by 


n(t) := exp(t€)n exp(—t€). 


By (i) we have exp(t€) € G and, by (ii), we have y(t) € g for every t € R. 
Hence [£,7] = 7(0) € g. This proves (iii) and Lemma 2.5.9. 


By Lemma 2.5.9 the curve 7 : R > G defined by y(t) := exp(t&)g is the 
integral curve of the vector field _X¢ in (2.5.8) with initial condition (0) = g. 
Thus X¢ is complete for every € € g. 
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Lemma 2.5.10. If€ € g andy: R—- G is a smooth curve satisfying 
vstoHn=vs)v@), yO=1, 0) =€, (2.5.9) 
then y(t) = exp(té) for everyt € R. 


Proof. For every t € R we have 


. d d ; 
=) vetn=4) ys =1O =H. 
S| s—0 $}s—0 
Hence ¥ is the integral curve of the vector field X¢ in (2.5.8) with y(0) = 1. 
This implies y(t) = exp(t&) for every t € R, as claimed. 


Example 2.5.11. Since the general linear group GL(n, R) is an open subset 
of R”*” its Lie algebra is the space of all real n x n-matrices 


gl(n, IR) := Lie(GL(n, R)) = R””. 
The Lie algebra of the special linear group is 
sl(n,R) := Lie(SL(n,R)) = {€ € gl(n, R) | trace(€) = 0} 


(see Exercise 2.2.9) and the Lie algebra of the special orthogonal group is 


50(n) := Lie(SO(n)) = {é E gl(n,R)|€7 +€ = o} = o(n) 


(see Example 2.2.10). 


Exercise 2.5.12. Prove that the Lie algebras of the general linear group 
over C, the special linear group over C, the unitary group, and the special 
unitary group are given by 


gl(n,C) : 
s(n, C) := Lie(SL(n,C)) = {€ € gl(n, C) | trace(é) = 0} , 
u(n) := Lie(U(n)) = {€ € gl(n, R) | &* +&=0}, 
su(n) := Lie(SU(n)) = {€ € gl(n,C) | €* +€ =0, trace(€) = 0}. 


These are vector spaces over the reals. Determine their real dimensions. 
Which of these are also complex vector spaces? 


Remark 2.5.13. Let G C GL(n,R) be a subgroup. In Theorem 2.5.26 
below it is shown that G is a Lie group if and only if it is a closed subset 
of GL(n,R) in the relative topology. This observation can be used in many 
of the examples and exercises of the present section. 


= Lie(GL(n,C)) = C’*", 
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Exercise 2.5.14. Let V be a finite-dimensional vector space. Prove that 
the vector space g := V x End(V) is a Lie algebra with the Lie bracket 


[(u, A), (v, B)] := (Av — Bu, AB — BA) (2.5.10) 


for u,v € V and A,B € End(V). Find the corresponding Lie group. Find 
an embedding of g into End(R x V) as a Lie subalgebra. 


Exercise 2.5.15. Let (V,w) be a 2n-dimensional symplectic vector space, 
so w:V x V +R is a nondegenerate skew-symmetric bilinear form. The 
Heisenberg algebra of (V,w) is the Lie algebra h := V x R with the Lie 
bracket of two elements (v,t), (v’,t’) € V x R defined by 


[(v, ¢), (v’, #)] -= (0,w(v, v’)). (2.5.11) 


Find a corresponding Lie group structure on H=V x R. Embed H as a 
Lie subgroup into GL(n + 2, R) and find a formula for the exponential map. 
Hint: Take V = R” x R” and w((z,y),(2’,y')) = (x, y’) — (y,2’) and 
define (x, y,t)- (a, yt) :=(w@+a',yt+y,t+t + (z,y’)). 


2.5.3 Lie Group Homomorphisms 


Let G,H be Lie groups and g,h be Lie algebras. A Lie group homomor- 
phism from G to H is a smooth map p: G > H that is a group homomor- 
phism. A Lie group isomorphism is a bijective Lie group homomorphism 
whose inverse is also a Lie group homomorphism. A Lie group automor- 
phism is a Lie group isomorphism from a Lie group to itself. A Lie algebra 
homomorphism from g to h is a linear map ® : g > that preserves the 
Lie bracket. A Lie algebra isomorphism is a bijective Lie algebra homo- 
morphism. A Lie algebra automorphism is a Lie algebra isomorphism 
from a Lie algebra to itself. 


Lemma 2.5.16. Let G and H be Lie groups and denote their Lie algebras 
by g:= Lie(G) and h := Lie(H). Let p: G— H be a Lie group homomor- 
phism and denote its derivative at 1€ G by 


p:=dp(1):g-> 5. 


Then p is a Lie algebra homomorphism. Moreover, 


p(exp(€)) = exp(A(§)), - p(g€97*) = p(g)A(E)(g)* 
for all€ Eg and allg EG. 
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Proof. The proof has three steps. 

Step 1. For all€ € g andt € R we have p(exp(t&)) = exp(tp(€)). 

Fix an element € € g. Then exp(t&) € G for every t € R by Lemma 2.5.9. 
Thus we can define a curve y : R > H by y(t) := p(exp(t&)). Since p is 


smooth, this is a smooth curve in H and, since p is a group homomorphism 
and the exponential map satisfies (2.5.6), our curve y satisfies the conditions 


Vstt)= s)y,  10)=1, (0) = dp(M)é = pté). 
Hence 7(t) = exp(to(€)) by Lemma 2.5.10. This proves Step 1. 
Step 2. For all g € G and € g we have p(gng—) = p(g)A(n)p(g) 


Define the smooth curve 7 : R > G by y(t) := gexp(tn)g~'. It takes values 
in G by Lemma 2.5.9. By Step 1 we have 
= 


-1 


= p(9) exp(ta(n))e(g) | 


1 


p(y(t)) = p(g)e(exp(tn)) (9) 
for every t. Since 7(0) = 1 and 4(0) = gng7 
p(gng-*) = dp(7(0))¥(0) 


si, P0) 


“ = p(g) exp(te(n))e(g) 


= p(g)P(n)p(g) 


we obtain 


-1 


-1 


This proves Step 2. 


Step 3. For all £,n € g we have p([é,n)) = [a(€),(n)]- 


Define the curve 7: R > g by n(t) := exp(té)n exp(—t&) for t € R. It takes 
values in the Lie algebra of G by Lemma 2.5.9 and 7(0) = [&, 7]. Hence 


P(E, nl) = =|  p(exp(té)n exp(—té)) 


=—|  p(exp(té)) 6(n)p (exp(—té)) 


err (tA(§)) (n) exp (—tA(E)) 


[A(§), A(n)] - 
Here the first equality follows from the fact that p is linear, the second 


equality follows from Step 2 with g = exp(t€), and the third equality follows 
from Step 1. This proves Step 3 and Lemma 2.5.16. 
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Exercise 2.5.17. A Lie group homomorphism p : G — H is uniquely deter- 
mined by the Lie algebra homomorphism fp whenever G is connected. Hint: 
If p1,2:G— H are Lie group homomorphisms such that 6; = 62, prove 
that the set A := {g € G| pi(g) = p2(g)} is both open and closed. 


Exercise 2.5.18. If 9: G—H is a bijective Lie group homomorphism, 
then p~! : H > G is smooth and hence p is a Lie group isomorphism. Hint: 
Use Lemma 2.5.16 to prove that 6: g > h is injective. If 6 is not surjective, 
show that p has no regular value in contradiction to Sard’s theorem. 


Example 2.5.19. The complex determinant defines a Lie group homomor- 
phism det : U(n) + $+. The associated Lie algebra homomorphism is 


trace = det : u(n) > iR = Lie(S?). 


Example 2.5.20 (Unit quaternions and SU(2)). The Lie group SU(2) 
is diffeomorphic to the 3-sphere. Every matrix in SU(2) can be written as 


a=( Loti, 2+ ir; 


2 2 2 2 
; ; =. 2.5.12 
ao bite tg = iy J Lo + LX{~ +X + 23 { 3) ) 


Here the x; are real numbers. They can be interpreted as the coordinates 
of a unit quaternion 7 = x + iv; + jrg + kx3 € Sp(1) (see Example 2.5.6). 


The reader may verify that the map Sp(1) > SU(2): 2+ g in (2.5.12) isa 
Lie group isomorphism. 


Exercise 2.5.21 (The double cover of SO(3)). Identify the imaginary 
part of H with R? and write a vector € € R® = Im(H) as a purely imaginary 
quaternion € = ig; + j&; + k&3. Prove that if € ¢ Im(H) and z € Sp(1), 
then «€z € Im(H). Define the map p: Sp(1) > SO(3) by 


p(x)E = xEx 


for x € Sp(1) and € € Im(H). Prove that the linear map p(x) : R? > R? is 
represented by the 3 x 3-matrix 


oe + a — ae — ae 2(x122 — Lox) 2(x123 + Lore) 
p(x) = 2(a12%q + X0x3) x2 + x — a — oe 2(x2x%3 — L021) 
2(a123 — XoL2) 2(x2x%3 + L021) x2 + Nie — ie — we 


Show that p is a Lie group homomorphism. Find a formula for the map 
p := dp(1) : sp(1) > s0(3) 


and show that it is a Lie algebra isomorphism. For x,y € Sp(1) prove 
that p(x) = p(y) if and only if y = +2. 
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Example 2.5.22. Let g be a finite-dimensional Lie algebra. Then the set 


® is a bijective linear map and 


BE, 7] = [GE, On] (2.5.13) 
for all €£,7 € g 


Aut(g) := 4 ®:g-g 


of Lie algebra automorphisms of g is a Lie group. Its Lie algebra is the 
space of derivations on g denoted by 


6 is a linear map and 


6[E,n]=[dE,n] +[E,6n] ¢- (2.5.14) 
for all €,n € g 


Der(g) := 4 d:g—-> 9 


Now suppose that g = Lie(G) is the Lie algebra of a Lie group G. Then 
there is a map Ad: G - Aut(g) defined by 


Ad(g)n == gng"' (2.5.15) 


for g € Gand 7 € g. Part (ii) of Lemma 2.5.9 asserts that Ad(g) maps g 
to itself for every g € G. It follows directly from the definitions that the 
map Ad(g) : g > g is a Lie algebra automorphism for every g € G and that 
the map Ad: G - Aut(g) is a Lie group homomorphism. The associated 
Lie algebra homomorphism is the linear map ad : g + Der(g) defined by 


ad()n := [€, 7] (2.5.16) 


for €,n € g. To verify the equation ad = Ad we compute 


Ad(exp(t§))n = ss 


d 
Ad(§)n = = 7 


i exp(té)n exp(—t€) = [€, 7]. 

t=0 
Exercise 2.5.23. Let g be any Lie algebra. Define the map ad : g > End(g) 
by (2.5.16) and prove that the endomorphism ad(£) : g > g is a derivation 
for every € € g. Prove that ad : g > Der(g) is a Lie algebra homomorphism. 


Exercise 2.5.24. Let g be any finite-dimensional Lie algebra. Prove that 
the group Aut(g) in (2.5.13) is a Lie subgroup of GL(g) with the Lie al- 
gebra Lie(Aut(g)) = Der(g). Hint: Show that a linear map 6:g-— g is 
a derivation if and only if the linear map exp(td) : g > g is a Lie algebra 


automorphism for every t € R. Use the Closed Subgroup Theorem 2.5.26. 
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2.5.4 Closed Subgroups 


This section deals with subgroups of a Lie group G that are also submanifolds 
of G. Such subgroups are called Lie subgroups. We assume throughout 
that G C GL(n,R) is a Lie group with the Lie algebra g := Lie(G) = 7yG. 


Definition 2.5.25 (Lie subgroup). A subset H C G is called a Lie sub- 
group of G iff it is both a subgroup and a smooth submanifold of G. 


A useful general criterion is the Closed Subgroup Theorem which asserts 
that a subgroup H C G is a Lie subgroup if and only if it is a closed subset 
of G. This was first proved in 1929 by John von Neumann [54] for the special 
case G = GL(n,R) and then in 1930 by Elie Cartan [15] in full generality. 


Theorem 2.5.26 (Closed Subgroup Theorem). Let H be a subgroup 
of G. Then the following are equivalent. 


(i) H is a smooth submanifold (and hence a Lie subgroup) of G. 
(ii) H is a closed subset of G. 
It (i) holds, then the Lie algebra of H is the space 


b := {n € g| exp(tn) EH for allt € R}. (2.5.17) 


Proof of Theorem 2.5.26 (i) => (ti) and (2.5.17). Assume that H is a Lie 
subgroup of G and let § C g be defined by (2.5.17). We prove that h is 
the Lie algebra of H. Assume first that 7 € h. Then the curve y:R—G 
defined by +(t) := exp(tn) for t € R takes values in H and satisfies y(0) = 1 
and +(0) =n, and this implies 7 € Ty)H = Lie(H). Conversely, if 7 € Lie(H), 
then Lemma 2.5.9 asserts that exp(t7) € H for all t € R and hence 7 € bh. 
This shows that = Lie(H). 

Next we prove in three steps that H is a closed subset of G. Choose any 
inner product on g, denote by |-| the associated norm, and denote by ht C g 
the orthogonal complement of h with respect to this inner product. 


Step 1. There exist open neighborhoods V C H of 1 and W cht of the 
origin such that the map 6: V x W > G, defined by 


o(h, ) = hexp(§) 


forheV and& EW, is a diffeomorphism from V x W onto an open neigh- 
borhood U = 6(V x W) CG of 1. 


The derivative of the map H x h+ > G: (h, £) 4 hexp(€) at the point (1, 0) 
is bijective. Hence Step 1 follows from the Inverse Function Theorem. 
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Step 2. There exists a 6>0 such that, if €,€’€ b+ satisfy |€|,|é’| <6 
and exp(€’) exp(—€) € H, then € = €’. 

Let V,W,¢@ be as in Step 1, choose an open neigborhood V’ Cc G of 1 such 
that V'N H = V, and choose a constant 6 > 0 such that the following holds. 
(a) If € € ht satisfies |€| < 6, then € € W. 

(b) If €,é’ € g satisfy |é|, |€’| < 6, then exp(é’) exp(—€) € V’. 

Let €,€’ € h+ such that |€], |€’| < 6 and h := exp(€’) exp(—€) € H. Then we 
have €,€’ € W by (a) and hE V’/NH=YV by (b). Also o(h,€) = (1, €’) 
and so € = €’, because ¢ is injective on V x W. This proves Step 2. 


Step 3. Let h; be a sequence in H that converges to an element g € G. 
Then g € H. 


Let 6: V x W > U beas in Step 1 and let 6 > 0 be as in Step 2. Since the 
sequence iS g converges to Il, there exists an ig € N such that he g € U for 
all i > ip. Hence, for each i > io, there exists a unique pair (hj, €) «€ V x W 
such that h; lg = hi, exp(&;). This sequence satisfies lim;_,.. & = 0. Hence 
there exists an integer i; > ig such that |€;| < 6 for all 7 > 71. Since 


hih; exp(&i) = g = hgh exp(§;), 

we also have exp(€;) exp(—€) = (hihi) 'hjhi, € H for all i,j > i1. By Step 3, 

this implies €; = €; for all 7,7 > 7,. Hence & = limj_,.. €; = 0 for all i > iy 
and so g = hh’, € H. This proves Step 3. 

By Step 3 the Lie subgroup H is a closed subset of G. Thus we have 

proved that (i) implies (ii) and (2.5.17) in Theorem 2.5.26. 


The proof of the converse implication requires three preparatory lemmas. 
Lemma 2.5.27. Let € € g and lety: R— G be a curve that is differentiable 
at t = 0 and satisfies y(0) = 1 and ¥(0) = €. Then 

exp(t€) = lim (t/k)* (2.5.18) 
k- 00 
for everyt ER. 
Proof. Fix a nonzero real number ¢ and define €, := k(9(t/k) = 1) E Rn 
for k € N. Then 
at Aten VO) etree 
hr an) i ii 


and hence 


k 
exp(t€) = jim (a + 1) a jim y(t/k)*. 


(See [64, Satz 1.5.2].) This proves Lemma 2.5.27. 
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Lemma 2.5.28. Let H C G be a closed subgroup. Then the set h in (2.5.17) 
is a Lie subalgebra of g 


Proof. Let €,7 € h and define the curve y : R > H by 
y(t) := exp(t§) exp(tn) 


for t€ R. This curve is smooth and satisfies y(0) = 1 and 4(0) =€+7. 
Since H is closed, it follows from Lemma 2.5.27 that 


exp(t(€ +) = Jim 7(t/k)* € H 


for allt € R and so +7 € § by definition. Thus § is a linear subspace of g. 
Now fix an element € € §. If h € H, then 


exp(sh7'¢h) = h7' exp(sé)h € H 


for all s € R and hence h~!€h € 6 by definition. Take h = exp(tn) with n € 6 
to obtain exp(—tn)€ exp(t7) € h for all t¢R. Differentiating this curve 
at t = 0 gives [€,7] € and this proves Lemma 2.5.28. 


Lemma 2.5.29. Let HC G be a closed subgroup and let h C g be the Lie 
subalgebra in (2.5.17). Let € € g, let (&)sen be a sequence in g, and let (7;)ic 


be a sequence of positive real numbers such that 


exp(&) € H, & #0 


for alli CN and 


, ; io Gt 
lim 7; = 0, lim €; = 0, lim = = €. 
100 i-oo too Tj 


Then € €h. 
Proof. Fix a real number t. Then, for each i € N, there exists a unique 


integer m; € Z such that m;7; < t < (m;+1)7;. The sequence m; satisfies 


; bi 
lim m7; = t, lim m,&; = lim m;7;— = t& 


and hence 
exp(t€) = lim exp(m,€;) = lim exp(€;)"”" € H. 
1CO oo 


Thus exp(t€) € H for every t € R and so € € § by (2.5.17). This proves 
Lemma 2.5.29. 
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Proof of Theorem 2.5.26 (ii) => (i). Choose any inner product on g and 
define a Riemannian metric on G by |v| := |g~tv| for g € G and v € 7,G. 
Let H C G be a closed subgroup of G and define the set h C g by (2.5.17). 
Then 6 is a Lie subalgebra of g by Lemma 2.5.28. Define 


k := dim(b), £=dimig) > k, 
and choose a basis 71,...,7¢ of g such that the vectors 7...., 7, form a basis 
of h and n, € h+ for v > k. Let ho € H and define the map 0 : R’ > G by 
e(t},...,t%) := Ap exp(ttm + --- + t%ng) exp(t ®t ney +--+ + tone). 


Then @(0) = ho, O(R* x {0}) C H, and the derivative dO(0) : R’ > T,,G 
is bijective. Hence the inverse function theorem asserts that © restricts to a 
diffeomorphism from an open neighborhood 2 C R* of the origin to an open 
neighborhood U := @(Q) C G of ho that satisfies 


@(0) = ho, 0(QN (R* x {0})) CUNH. 


We prove the following. 


Claim. There exists an open set Qo C R* such that 
0ENMCN, O(N) N(R* x {0})) =UpNH, — Up = O(Mo). (2.5.19) 


Assume, by contradiction, that such an open set Q9 does not exist. Then 


there exists a sequence t; = (t},...,¢£) € R® such that 
limt;=0, #€9\(R*x{0}),  O(t;) EH. 
Ico 
Define 
k é 
hy := ho exp (oem) EH, G:= S© tm eb" \ {0}. 
v=1 v=k+1 


Then h; exp(&;) = O(t;) € H and hence 
lim &=0, #0, — exp(&i) =h; 'O(ti) € H. 
IFoo 


Passing to a subsequence, if necessary, we may assume that the sequence 
& /|€| converges. Denote its limit by € := limi &/|&|. Then € € h by 
Lemma 2.5.29 and € € + by definition. Since |€| = 1, this is a contradiction. 
This contradiction proves the Claim. Thus there does, after all, exist an 
open set Q C R* that satisfies (2.5.19), and the map O71 : Up + Oo is then 
a coordinate chart on G which satisfies O~!(Up NH) = 2M (R* x {0}). 
Hence H is a submanifold of G and this proves Theorem 2.5.26. 
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Exercise 2.5.30. The subgroup {exp(it) | t € Q} C S$! is not closed. 


Exercise 2.5.31. Choose a nonzero vector (w1,...,Wn) € R” such that at 
least one of the ratios w;/w; is irrational. Prove that the subgroup 


S. — fier. errihes. seas gree, | te R} Cc iso? ~T 

of the torus is not closed. Similar examples exist in any Lie group that 
contains a torus of dimension at least two. 
Exercise 2.5.32. Let Go and G, be Lie subgroups of GL(n,R) with the 
Lie algebras go := Lie(Go) and g; := Lie(G;). Prove that G := Go NG; isa 
Lie subgroup of GL(n,R) with the Lie algebra g = go N gt. 
Exercise 2.5.33 (Center). The center of a group G is the subgroup 

Z(G) := {g EG | gh = hg for all he G} : (2.5.20) 
Let G Cc GL(n, R) be a Lie group. Prove that its center Z(G) is a Lie sub- 


group of G. If G is connected, prove that the Lie algebra of the center Z(G) 
is the center of the Lie algebra g = Lie(G), defined by 


Z(g) = {€ € g| [E,n] =0 for all neg}. (2.5.21) 


Hint: If G is connected, prove that an element € € g satisfies [£,7] = 0 for 
all 7 € g if and only if exp(t€)h = hexp(té) for allt € R and all hE G. 


Exercise 2.5.34. Let GC GL(n,R) be a compact Lie group with the Lie 
algebra g := Lie(G) and let € € g. Prove that the set Te := {exp(t€) | teR} 
is a closed, connected, abelian subgroup of G and deduce that it is a Lie 
subgroup of G (called the torus generated by £). 


Exercise 2.5.35. Let GC GL(n, R) be a Lie group with the Lie algebra g 
and let € : R — g be a smooth function. Prove that the differential 
equation ¥(t) = &(t)y(t), y(0) = 1, has a unique solution y : R > G. 
Hint: Prove the existence of a solution 7: R— GL(n,R) and show that 
the set {t € R| y(t) € G} is open and closed. 


Remark 2.5.36 (Malcev’s Theorem). Let G C GL(n,R) be a Lie group 
with the Lie algebra g and let h C g be a Lie subalgebra. Then the set 


h: [0,1] > G is a smooth path 
H := ¢ h(1)| such that h(0) = 1 and (2.5.22) 
h(t)h(t)~! € b for all ¢ € [0,1] 
is a subgroup of G, called the integral subgroup of §. A theorem by 
Anatolij Ivanovich Malcev [47] (see also [28, Corollary 13.4.6]) asserts that H 
is a Lie subgroup of G if and only if T;,, := {exp(t7) | t€R} CH foralln € b. 
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2.5.5 Lie Groups and Diffeomorphisms 


There is a natural correspondence between Lie groups and Lie algebras on 
the one hand and diffeomorphisms and vector fields on the other hand. We 
summarize this correspondence in the following table 


Lie groups Diffeomorphisms 
G c GL(n, R) Diff(M) 
g = Lie(G) = T7G Vect(M) = TjqDiff(M) 
exponential map flow of a vector field 
t +> exp(t€) try of = “exp(tx)” 
adjoint representation pushforward 
Er gég* X ++ bX 
Lie bracket on g Lie bracket of vector fields 
[é,7] = &n — n€ [X,Y] =dX -Y—dY -X. 


To understand the correspondence between the exponential map and the 
flow of a vector field compare equation (2.4.6) with equation (2.5.5). To un- 


derstand the correspondence between the adjoint representation and push- 
forward observe that 


iva 


ee gexp(tn)g”', 


t=0 


d 
t, 4-1 -1_ & 
eee ; ng =| 


where y' denotes the flow of Y. To understand the correspondence between 
the Lie brackets recall that 


d 


avis") Ge fe" 


a a exp(t€)7 exp(—t€), 
t=0 


t=0 


where ¢' denotes the flow of X. We emphasize that the analogy between 
Lie groups and Diffeomorphisms only works well when the manifold M is 
compact so that every vector field on M is complete. The next exercise gives 
another parallel between the Lie bracket on the Lie algebra of a Lie group 
and the Lie bracket of two vector fields. 


Exercise 2.5.37. Let G C GL(n,R) be a Lie group with Lie algebra g and 
let €,7 € g. Define the smooth curve y : R > G by 

y(t) == exp(t€) exp(tm) exp(—té) exp(—t7). 
Show that +(0) = 0 and $4(0) = [€, n] (cf. Exercise 2.5.8 and Lemma 2.4.18). 


Exercise 2.5.38. Let G C GL(n,R) be a Lie group with Lie algebra g and 
let €,7 € g. Show that [€,7] = 0 if and only if the exponential maps com- 
mute, i.e. exp(s€) exp(t7) = exp(t7) exp(s€é) = exp(s€ + ty) for all s,t E R. 
How can this observation be deduced from Lemma 2.4.26? 
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Definition 2.5.39. Let M C R* be a smooth manifold and let G C GL(n, R) 
be a Lie group. A (smooth) group action of G on M is a smooth map 


Gx M-M: (g,p) + ¢4(p) (2.5.23) 
that for each pair g,h € G satisfies the condition 
0g ° Pr = Pgh; oy = id. (2.5.24) 


If (2.5.23) is a smooth group action, then the infinitesimal action of the 
Lie algebra g := Lie(G) on M is the map g — Vect(M):€ ++ X¢ defined by 


d 
X¢(p) = dt fg Pexp(ye) (P) (2.5.25) 


forE eg andpe M. 


Exercise 2.5.40. Let (2.5.23) be a smooth group action of a Lie group G 
on a manifold M. Prove that 


Xg-lég = bg XE; Xen] = [Xe, Xn] (2.5.26) 
for all g € G and all €,7 € g = Lie(G). 


Exercise 2.5.41. Show that the maps GL(m, R) x R” > R™: (gz) 4 gz, 
SO(m+1) x S™ = 8™: (9,2) 4 gz, and Rx $1 $1: (6,z) 4 ez are 
smooth group actions. Verify the formulas in (2.5.26) in these examples. 


A smooth group action of a Lie group G on a manifold M can the thought 
of asa “Lie group homomorphism” 


G > Diff(M) : 9g + dz. (2.5.27) 


While the group Diff(J/) is infinite-dimensional, and so cannot cannot be a 
Lie group in the formal sense, it has many properties in common with Lie 
groups as explained above. For example, one can define what is meant by a 
smooth path in Diff(7) and extend formally the notion of a tangent vector 
(as the derivative of a path through a given element of Diff(Z)) to this 
setting. In particular, the tangent space of Diff(/) at the identity can then 
be identified with the space of vector fields T,qDiff(17) = Vect(M), and the 
infinitesimal action in (2.5.25) is the Lie algebra homomorphism associated 
to the “Lie group homomorphism” (2.5.27). In fact, we have chosen the sign 
in the definition of the Lie bracket of vector fields so that the map € +> X¢ 
is a Lie algebra homomorphism and not a Lie algebra anti-homomorphism. 
This will be discussed further in §2.5.7. 
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2.5.6 Smooth Maps and Algebra Homomorphisms 


Let M be a smooth submanifold of R*. Denote by ¥(M) := C™(M,R) 
the space of smooth real valued functions f: M—>R. Then ¥(M) isa 
commutative unital algebra. Each p © M determines a unital algebra ho- 
momorphism ¢, : ¥(M) — R defined by €,(f) = f(p) for pe M. 


Theorem 2.5.42. Every unital algebra homomorphism ¢ : #(M) > R has 
the form € = €, for somepe M. 


Proof. Assume that ¢ : #(M) —> R is an algebra homomorphism. 
Claim. For all f,g € ¥(M) we have e(g) =0 = e(f) € f(g7'(0)). 
Indeed, the function f — e(f)-1 lies in the kernel of ¢ and so the func- 
tion h := (f — e(f)-1)? +g? also lies in the kernel of ¢. There must be at 
least one point p € M where h(p) = 0 for otherwise 1 = e(h)e(1/h) = 0. 
For this point p we have f(p) = e(f) and g(p) = 0, hence p € g~'(0), and 
therefore e(f) = f(p) € f(g~1(0)). This proves the claim. 

The theorem asserts that there exists ap € M such that every f € ¥(M) 
satisfies e(f) = f(p). Assume, by contradiction, that this is false. Then for 
every p € M there exists a function f € #(M) such that f(p) # e(f). Re- 
place f by f —e(f) to obtain f(p) 4 0 = e(f). Now use the axiom of choice 
to obtain a family of functions f, € 4(M), one for every p€ M, such 
that fp(p) A0=e(fp) for all pe M. Then the set U, := f,'(R \ {0}) is 
an M-open neighborhood of p for every p € M. Choose a sequence of com- 
pact sets K, C M such that Ky, C inty(Kn+1) for all n and M=U,, Kn. 
Then, for each n, there is a gn € -F(M) (a finite sum of the form }>; f?,) such 
that €(g,) = 0 and g,(q) > 0 for all g € Ky. If M is compact, this is already 
a contradiction because a positive function cannot belong to the kernel of e. 
Otherwise, choose f € F(M) such that f(q) >n for all ge M\ kK, and 
alln € N. Then e(f) € f(g, 1(0)) C f(M \ Kn) C [n,0o) by the claim and 
so €(f) > n for all n. This is a contradiction and proves Theorem 2.5.42. 


Now let N be another smooth submanifold (say of R°) and let C~(M, N) 
denote the space of smooth maps from M to N. A homomorphism from 
F(N) to F(M) is a (real) linear map ®: ¥(N) > F(M) that satisfies 


(fg) =O(f)®(g), B()=1. 


Let Hom(.¥(N), #(M)) denote the space of homomorphisms from ¥(N) 
to #(M). An automorphism of the algebra ¥(M) is a bijective homo- 
morphism ® :. ¥(M) > #(M). The automorphisms of Y(M) form a group 
denoted by Aut(.¥(M)). 
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Corollary 2.5.43. The pullback operation 

C“(M, N) > Hom(¥(N),. F(M)):d4 ¢* 
is bijective. In particular, the map Diff(M) > Aut(.F(M)): @ ¢* is an 
anti-isomorphism of groups. 
Proof. This is an exercise with hint. Let ®: ¥(N) > ¥(M) be a unital 
algebra homomorphism. By Theorem 2.5.42 there exists a map ¢: M > N 


such that €, o ® = €4(p) for all p € M. Prove that fog: M — R is smooth 
for every smooth map f : N > R and deduce that ¢ is smooth. 


Remark 2.5.44. The pullback operation is functorial, i.e. 

(pod) =r od", ids = idg(m)- 
for 6 € C™®(M,N) and w € C™~(N,P). Here id denotes the identity map 
of the space indicated in the subscript. Hence Corollary 2.5.43 may be 
summarized by saying that the category of smooth manifolds and smooth 


maps is anti-isomorphic to a subcategory of the category of commutative 
unital algebras and unital algebra homomorphisms. 


Exercise 2.5.45. If MM is compact, then there is a slightly different way to 
prove Theorem 2.5.42. An ideal in (MJ) is a linear subspace Y C F(M) 
satisfying the condition f€ F(M),gEe FY = fge Y. A maximal 
ideal in ¥(M) is an ideal Y C #(M) such that every ideal Y' C F(M) 
containing Y is equal to Y. Prove that, if M is compact and Y C F(M) 
is an ideal with the property that for every p¢ M there is an f €¢ Y 
with f(p) £0, then Y = ¥(M). Deduce that each maximal ideal in ¥ (M) 
has the form Y, := {f €¢ F(M)| f(p) = 0} for some p € M. 


Exercise 2.5.46. If M is compact, give another proof of Corollary 2.5.43 
as follows. The set ®~1(_Y,) is a maximal ideal in ¥(N) for each pe M. 
Use Exercise 2.5.45 to deduce that there is a unique map ¢: M > N such 
that &-"( 7,) = A wp) for all p € M. Show that ¢ is smooth and ¢* = ®. 


Exercise 2.5.47. It is a theorem of ring theory that, when J C R is an ideal 
in aring R, the quotient ring R/T is a field if and only if the ideal J is max- 
imal. Show that the kernel of the ring homomorphism ¢€, : F(M) — R of 
Theorem 2.5.42 is the ideal Y%, of Exercise 2.5.45. Conclude that M is com- 
pact if and only if every maximal ideal Y in ¥(M) is of the form Y = ZY, 
for some p € M. Hint: The functions of compact support form an ideal. It 
can be shown that if M is not compact and Y is a maximal ideal contain- 
ing all functions of compact support, then the quotient field F(M)/Y isa 
non-Archimedean ordered field which properly contains R. 
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2.5.7 Vector Fields and Derivations 
A derivation of #(M) is a linear map 6: #(M) > F(M) that satisfies 


O(fg) = O(f)g + fo(g). 


and the derivations form a Lie algebra denoted by Der(.¥(M/)). We may 
think of Der(.F(M)) as the Lie algebra of Aut(.¥(M)) with the Lie bracket 
given by the commutator. By Theorem 2.5.42 the pullback operation 


Diff(M) > Aut(F(M)):¢4 ¢* (2.5.28) 


can be thought of as a Lie group anti-isomorphism. Differentiating it at the 
identity ¢@ = id gives a linear map 


Vect(M) > Der(.F(M)) : X + Lx. (2.5.29) 


Here the operator Ly : F¥(M) — ¥(M) is given by the derivative of a 
function f in the direction of the vector field X, i.e. 


d 
Lxfimdf-X=) fod, 
Elio 
where ¢' denotes the flow of X. Since the map (2.5.29) is the derivative 
of the “Lie group” anti-homomorphism (2.5.28) we expect it to be a Lie 
algebra anti-homomorphism. Indeed, one can show that 


Lixy) =LyLx —Lxly = —[Lx, Ly] (2.5.30) 


for X,Y € Vect(M). This confirms that our sign in the definition of the Lie 
bracket in §2.4.3 is consistent with the standard conventions in the theory 
of Lie groups. In the literature the difference between a vector field and the 
associated derivation Lx is sometimes neglected in the notation and many 
authors write Xf := df. X = Lxf, thus thinking of a vector field on a 
manifold M as an operator on the space of functions. With this notation one 
obtains the equation [X,Y]f = Y(Xf) — X(Y f) and here lies the origin for 
the use of the opposite sign for the Lie bracket in many books on differential 
geometry. 


Exercise 2.5.48. Prove that the map (2.5.29) is bijective. Hint: Fix a 
derivation 6 € Der(-¥(M)) and prove the following. Fact 1: If U Cc M is 
an open set and f € ¥(M) vanishes on U, then 6(f) vanishes on U. Fact 2: 
If p € M and the derivative df(p) : T,M — R is zero, then (d(f))(p) = 0. 
(By Fact 1, the proof of Fact 2 can be reduced to an argument in local 
coordinates. ) 


Exercise 2.5.49. Verify the formula (2.5.30). 
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2.6 Vector Bundles and Submersions 


2.6.1 Submersions 


Let M Cc R* be asmooth m-manifold and N C R¢ be a smooth n-manifold. 
A smooth map f : N > M is called a submersion iff its derivative 


df (q) : TaN => T5(q)M 
is surjective for every q € N. 


(re 7 


[ie ee =i 


ale 


M U 
Figure 2.12: A local right inverse of a submersion. 


Lemma 2.6.1. Let M Cc R* be a smooth m-manifold, N C R be a smooth 
n-manifold, and f : N + M be a smooth map. The following are equivalent. 


(i) f is a submersion. 


(ii) For every qo € N there is an M-open neighborhood U of po := f(qo) and 
a smooth map g: U — N such that g(f(qo)) =qo and fog=id:U >U. 
Thus f has a local right inverse near every point in N (see Figure 2.12). 


Proof. We prove that (i) implies (ii). Since the derivative 
df (go) : Lao. — Tp,M 

is surjective we have n > m and 
dim ker df (qo) =n —m. 


Hence there is a linear map A: R’ > R"~™ whose restriction to the kernel 
of df (qo) is bijective. Now define the map j: N > M x R"—™ by 


¥(q) = (f(g), Aa — 40) 
for g € N. Then w(qo) = (po, 0) and its derivative 


dwb(qo) : Lay N — Tp) M x R°-™ 


2.6. VECTOR BUNDLES AND SUBMERSIONS 73 


sends w € Ty,.N to (df(qo)w, Aw) and is therefore bijective. Hence it follows 
from the inverse function theorem for manifolds (Theorem 2.2.17) that there 
exists an N-open neighborhood V Cc N of qo such that the set 


W :=0(N) CMxR™™ 


is an open neighborhood of (po,0) and w#|y : V > W is a diffeomorphism. 
Let 
U:={pe M|(p,0) € W} 


and define the map g: U > N by 


Then po € U, g is smooth and 
(p,0) = H(g(p)) = (F(g(p)), A(g(P) — 40)). 
Hence f(g(p)) = p for all p € U and 
g(po) = b* (po, 0) = qo. 


This shows that (i) implies (ii). The converse is an easy consequence of the 
chain rule and is left to the reader. This proves Lemma 2.6.1 


Corollary 2.6.2. The image of a submersion f : N — M is open. 


Proof. If po = f(qo) € f(N), then the neighborhood U Cc M of po in 
Lemma 2.6.1 (ii) is contained in the image of f. 


Corollary 2.6.3. If N is a nonempty compact manifold, M is a connected 
manifold, and f : N > M is a submersion, then f is surjective and M is 
compact. 


Proof. The image f(N) is an open subset of M by Corollary 2.6.2, it is a 
relatively closed subset of I because N is compact, and it is nonempty 
because N is nonempty. Since M is connected this implies that f(N) = M. 
In particular, M is compact. 


Exercise 2.6.4. Let f : N — M be a smooth map. Prove that the 
sets {q € N | df(q) is injective} and {q € N | df(q) is surjective} are open (in 
the relative topology of NV). 
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2.6.2 Vector Bundles 


Let M Cc R* be an m-dimensional smooth manifold. 


Definition 2.6.5. A (smooth) vector bundle (over M of rank n) is a 
smooth submanifold E C M x R* of dimension m+n such that, for every 
point p € M, the set 


Ey = {ve R*|(p,0) < ES 


is an n-dimensional linear subspace of R (called the fiber of E over p). 
A vector bundle E over M is equipped with a smooth map 


rT: EO M 


defined by (p,v) := p. This map is called the canonical projection of E. 
If EC Mx R* is a vector bundle, then a (smooth) section of E is a 
smooth map s: M — R° such that s(p) € E, for every p€ M. 


A section s : M —> R° of a vector bundle E over M determines a smooth 
map 0 : M - E which sends the point p € M to the pair (p,s(p)) € EF. 
This map satisfies 7 o o = id. It is sometimes convenient to abuse notation 
and eliminate the distinction between s and o. Thus we will sometimes use 
the same letter s for the map from M to R¢ and the map from M to E. 


Definition 2.6.6. Let M Cc R* be a smooth m-manifold. The set 
LM := {(p, v) |p € M, VE T,M} 
is called the tangent bundle of M. 


The tangent bundle is a subset of M x R* and, for every p € M, its 
fiber T,M is an m-dimensional linear subspace of R* by Theorem 2.2.3. 
However, it is not immediately obvious from the definition that TM is a 
submanifold of M x R*. This will be proved below. The sections of TM are 
the vector fields on M. 


Exercise 2.6.7. Let f : M — N be a smooth map between manifolds. 
Prove that the tangent map TM > TN: (p,v) + (f(p), df(p)v) is smooth. 


Exercise 2.6.8. Let M C R* be a smooth m-manifold and let 6: U > 2 
be a smooth coordinate chart on an M-open set U C M with values in an 
open set Q = ¢(U) CR™. Prove that the map ¢: TU — Q x R™ defined 
by ¢(p, v) := (¢(p),d¢(p)) for p € U and v € T,M is a diffeomorphism. It 
is called a standard coordinate chart on TM. Deduce that TM is 
a smooth 2m-dimensional submanifold of M x R* and hence is a smooth 
vector bundle over M. (See also Corollary 2.6.12 below.) 
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Exercise 2.6.9. Let V C R® be an n-dimensional linear subspace. The 
orthogonal projection of R‘ onto V is the matrix II € R“* that satisfies 


T=Ir=', iml=v. (2.6.1) 


Prove that there is a unique matrix I € R*® satisfying (2.6.1). Prove that, 
for every symmetric matrix S = S' € R“*, the kernel of S is the orthogonal 
complement of the image of S. If D € R®"” is any injective matrix whose 
image is V, prove that det(D'D) 40 and 


T= D(D'D)""D". (2.6.2) 


Theorem 2.6.10 (Vector bundles). Let M C R* be a smooth m-manifold 
and let EC M x R° be a subset. Assume that, for every p € M, the set 


Ey := {ov ER®|(p,v) € BE} (2.6.3) 


is an n-dimensional linear subspace of R®. Let Il: M — R®? be the map 
that assigns to each p € M the orthogonal projection of R® onto Bin, 48 


I(p) =1(p)* =1(p)", im I (p) = Ep. (2.6.4) 


Then the following are equivalent. 

(i) E is a vector bundle. 

(ii) For every po € M and every vp € Ep, there is a smooth map s: M > R® 
such that s(po) = vo and s(p) € Ep for allpe M. 

(iii) The map IL: M > R°* is smooth. 

(iv) For every po € M there is an open neighborhood U Cc M of po and a 
diffeomorphism m'(U) + U x R": (p,v) 4 ®(p,v) = (p, ®p(v)) such that 
the map ®, : Ey, + R” is an isometric isomorphism for all p € U. 

(v) For every po € M there is an open neighborhood U C M of po and a 
diffeomorphism n1(U) + U x R”: (p,v) + ®(p,v) = (p,®,(v)) such that 
the map ®, : E, — R” is a vector space isomorphism for all p € U. 


Condition (i) implies that the projection x : E + M is a submersion. In (ii) 
the section s can be chosen to have compact support, i.e. there exists a 
compact subset K C M such that s(p) =0 for allpe M\ K. 


Proof. See page 77. 
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Definition 2.6.11. The maps ®: x !(U) + U x R” in Theorem 2.6.10 are 
called local trivializations of E. They fit into commutative diagrams 


nm 1(U) . U xR". 


Corollary 2.6.12. Let M C R* be a smooth m-manifold. Then TM is a 
vector bundle over M and hence is a smooth 2m-manifold in R*® x R*. 


Proof. Let 6: U + Q be a coordinate chart on an M-open set U C M with 
values in an open subset 2 C R™. Denote its inverse by = := @ 1: Q—> M. 
By Theorem 2.2.3 the linear map dy)(x) : R™ —> R* is injective and its image 
is Ty@)M for every x € 2. Hence the map D: U > R**™ defined by 


D(p) := db(d(p)) € REX™ 


is smooth and, for every p € U, the linear map D(p) : R™ — R* is injec- 
tive and its image is 7,M. Thus the function TW?” : M > R*** defined 
by (2.6.4) with E, = T,M is given by 


1" (p) = D(p) (D)"D@)) D(p)' for pe U. 


Hence II? is smooth and so T'M is a vector bundle by Theorem 2.6.10. 


Let M Cc R* be an m-manifold, N C R° be an n-manifold, f : N > M 
be a smooth map, and E Cc M x R@ be a vector bundle. The pullback 
bundle is the vector bundle f*E — N defined by 


fra { (4,0) ENxR\ve Ex} 
and the normal bundle of E is the vector bundle E+ —> M defined by 
Ets= { (p,w) EM x R4|(v,w) =0VvE Ey}. 


Corollary 2.6.13. The pullback and normal bundles are vector bundles. 


Proof. Let Il = Il” : M -+ R?¢4 be the map defined by (2.6.4). This map 
is smooth by Theorem 2.6.10. Moreover, the corresponding maps for f* FE 
and E+ are given by 


m= =F of:N3R®4, TF =1-F:M>R*4, 


These maps are smooth and hence it follows from Theorem 2.6.10 that f* 
and E+ are vector bundles. 
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Proof of Theorem 2.6.10. We first assume that FE is a vector bundle and 
prove that 7: E > M is a submersion. Let o : M — E denote the zero 
section given by o(p) := (p,0). Then roo = id and hence it follows from the 
chain rule that the derivative dr(p,0) : Tip) — TpM is surjective. Now 
it follows from Exercise 2.6.4 that for every p € M there is an ¢ > 0 such 
that the derivative dr(p,v) : Tip) — TM is surjective for every v € Ep 
with |v| < «. Consider the map f, : E — E defined by 


Sr(p, v) = (p, dv). 


This map is a diffeomorphism for every 4 > 0. It satisfies 


T= To fy 
and hence 
dr(p,v) = dr(p, Av) o dfy(p, v) : Tip») > TpM. 


Since df(p, v) is bijective and dz(p, Av) is surjective for  < ¢/ |v| it follows 
that dm(p,v) is surjective for every p € M and every v € E,. Thus the 
projection 7: E + M is a submersion for every vector bundle EF over M. 
We prove that (i) implies (ii). Let pp € M and vp € Ep . We have 
already proved that 7 is a submersion. Hence it follows from Lemma 2.6.1 


that there exists an M-open neighborhood U C M of pg and a smooth map 
o9:U>E 


such that 
moog =id:UU, 70(Po) = (Po, Vo). 


Define the map sp : U > R* by 
(p, so(p)) = o0(p) — for pe U. 
Then so(po) = vo and so(p) € Ep for all p € U. Now choose ¢ > 0 such that 
{p€M| |p—pol <e} CU 


and choose a smooth cutoff function 6 : R* — [0,1] such that 8(po) = 1 
and 6(p) = 0 for |p — po| > ¢. Define s: M > R? by 


{ B(p)so(p), if p € U, 
0, if p ZU. 


This map satisfies the requirements of (ii). 
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We prove that (ii) implies (iii). Thus we assume that F satisfies (ii). 
Choose po € M and a basis v1,...,Un of Ep . By (ii) there exists smooth 


sections s1,...,8, : M > R° of E such that s;(po) = v; fori = 1,...,n. Now 
there exists an M-open neighborhood U Cc M of po such that the vec- 
tors s1(p),...,%n(p) are linearly independent, and hence form a basis of E, 


for every p € U. Hence, for every p € U, we have 
Ey=imD(p), —-D(p) := [s1(p)--- sn(p)] € R™”. 
By Exercise 2.6.9, this implies II(p) = D(p)(D(p)'D(p))~!D(p)" for ev- 
ery p€ U. Thus every po € M has a neighborhood U such that the re- 
striction of II to U is smooth. This shows that (ii) implies (iii). 
We prove that (iii) implies (iv). Fix a point po € M and choose a ba- 
SiS U1,...,Un Of E,,. For p € M define 
D(p) := [I(p)v1 «+ TI(p)vn] € R™" 
Then D: M -> R®" is a smooth map and D(po) has rank n. Hence the set 
U :={pe M|rankD(p) =n} CM 
is an open neighborhood of pp and E, = imD(p) for all p € U. Thus 
aU) ={(p,v) € E|peU} CE 
is an open set containing 7~!(po). Define the map ® : t~!(U) > U x R” by 


B(p,0) = (p,,(0)), pv) = (DO)TD@)) DET» 


for p€ U and v € E,. This map is bijective and its inverse is given by 


®'(p, €) = (p, @'(€)), ®, '(é) = D(p) (Dip) D0) We 


for p€ U and €€ R”. Thus @ is a diffeomorphism and |®,(v)| = |v| for 
all p € U and all v € E,. This shows that (iii) implies (iv). 

That (iv) implies (v) is obvious. 

We prove that (v) implies (i). Shrinking U if necessary, we may as- 
sume that there exists a coordinate chart ¢: U > Q with values in an open 
set Q ¢ R™. Then the composition (¢ x id) o ® : m-1(U) > QO x R” isa dif- 
feomorphism. Thus E C R* x R° is a manifold of dimension m +n and this 
proves Theorem 2.6.10. 


Exercise 2.6.14. Define the notion of an isomorphism between two vector 
bundles E and F over M. Construct a vector bundle E C S! x R? of rank 1 
that does not admit a global trivialization, i.e. that is not isomorphic to the 
trivial bundle S$! x R. Such a vector bundle is called a MGbius strip. 
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The Implicit Function Theorem 


Next we carry over the Implicit Function Theorem in Corollary A.2.6 to 
smooth maps on vector bundles. 


Theorem 2.6.15 (Implicit Function Theorem). 

Let M C R* be a smooth m-manifold, let N C R® be a smooth n-manifold, 
let EC M x R® be a smooth vector bundle of rank n, let W C E be open, 
and let f: W + N be a smooth map. For p€ M define fp: Wp — N by 


W, := {v € Ey| (p,v) € WH, fale) = Foe): 


Let po € M such that 0 € W,, and dfp,(0): Tp,M —- Ty,N is bijective, 
where qo := f(po,0) € N. Then there exists a constant e > 0, open neighbor- 
hoods Up C M of pp and Vo C N of qo, and a smooth map h: Up x Vo > Ré 
such that {(p,v) € E|p € Up, |u| < e} C W and 


h(p,q)€ Ep, — |h(p, | <e (2.6.5) 
for all (p,q) € Uo x Vo and 
fp(v) =0 = v = h(p,q) (2.6.6) 


for all (p,q) € Up x Vo, and all v € Ey with |v| <e. 


Proof. Choose a coordinate chart ~: V > R” on an open set V C N con- 
taining go. Choose an open neighborhood U C M of po such that (p,0) € W 
and f(p,0) € V for all p € U, there is a coordinate chart ¢: U + QC R™, 
and there is a local trivialization ® : 7~'(U) — U x R” as in Theorem 2.6.10 
with |®,(v)| = |v| for p € U and v € E,. Define B, := {€ € R"| |E| <r} and 
choose r > 0 so small that ®-!(U x B,) CW and fo® (U x B,) CV. 
Define the map fF’: Q x R” x B, > R” by 


F(z,y,€):= yo fo (¢"(a),g) -y 
for (x,y) €Qx R” and €€ B,. Let xo := (po) and yo := (qo). Then 
we have F'(29, yo,0) = O and the derivative d3F'(x9, yo, 0) : R" > R” of F 
with respect to € at (xo, yo,0) is bijective. Hence Corollary A.2.6 asserts 


that there exist open neighborhoods Up C U of po and Vo C V of qo, a con- 
stant 0 <é <r, and asmooth map g: (Up) x (Vo) > Be such that 


F(z,y,€) =0 = g(x,y) =€ 
for all (x,y) € 6(Uo) x w(Vo) and all € € B-. Thus the map 
h:UoxVo>R‘, — h(p,q) = 5 *(9(9(), ¥(Q))), 


satisfies the requirements of Theorem 2.6.15. 
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2.7 The Theorem of Frobenius 


Let M Cc R* be an m-dimensional manifold and n be a nonnegative integer. 
A subbundle of rank n of the tangent bundle TM is a subset EF C TM 
that is itself a vector bundle of rank n over M, i.e. it is a submanifold 
of TM and the fiber Ep = {v € T,M | (p,v) € E} is an n-dimensional linear 
subspace of T,.M for every p € M. Note that the rank n of a subbundle 
is necessarily less than or equal to m. In the literature a subbundle of the 
tangent bundle is sometimes called a distribution on M. We shall, however, 
not use this terminology in order to avoid confusion with the concept of a 
distribution in the functional analytic setting. 


Definition 2.7.1. Let M C R* be an m-dimensional manifold and E C TM 
be a subbundle of rank n. The subbundle E is called involutive if, for any 
two vector fields X,Y € Vect(M), we have 


X(p),Y(p) € Ey Vp eM = [X,Y](p) © EpVpe M. (2.7.1) 


The subundle E is called integrable if, for every po © M, there exists a 
submanifold N C M such that po € N and T,N = Ey, for every p € N. 
A foliation box for E (see Figure 2.13) is a coordinate chart ¢: U > Q 
on an M-open subset U C M with values in an open set Q C R” x R™” 
such that the set QM (R" x {y}) is connected for every y € R™” and, for 
every p€ U and every v € T,M, we have 


v€ Ep — dd(p)v € R” x {0}. 


(ee 


Figure 2.13: A foliation box. 


Theorem 2.7.2 (Frobenius). Let M C R* be an m-dimensional manifold 
and EC TM be a subbundle of rank n. Then the following are equivalent. 


(i) E is involutive. 
(ii) E is integrable. 
(iii) For every po € M there exists a foliation bor 6: U 4 Q with po € U. 


Proof. See page 81. 
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It is easy to show that (iii) => (ii) => (i) (see below). The hard part of 
the theorem is to prove that (i) ==> (iii). This requires the following lemma. 


Lemma 2.7.3. Let E C TM be an involutive subbundle and X € Vect(M) 
be a complete vector field such that X(p) € Ep for every p€ M. Denote by 


R > Diff(M): tH ¢ 
the flow of X. Then, for allt € R and allp € M, we have 


dg' (p) Ep = Egtpy: (2.7.2) 


Proof. See page 83. 


Lemma 2.7.8 implies Theorem 2.7.2. We prove first that (iii) implies (ii). 
Let po € M, choose a foliation box ¢: U > 2 for E with po € U, and define 


N := (p€ U| d(p) € R” x {yo}} 


where (20, yo) := $(po) € 2. Then N satisfies the requirements of (ii). 

We prove that (ii) implies (i). Choose two vector fields X,Y € Vect(M) 
that satisfy X(p),Y(p) € Ep for all p € M and fix a point pp € M. Then, 
by (ii), there exists a submanifold N C M containing po such that T,N = Ep 
for every p€ N. Hence the restrictions X|y and Y|y are vector fields 
on N and so is the restriction of the Lie bracket [X,Y] to N. Thus we 
have [X,Y ](po) € Ip) N = Ep, as claimed. 

We prove that (i) implies (iii). Thus we assume that EF is an involutive 
subbundle of 7’M and fix a point pp € M. By Theorem 2.6.10 there exist 
vector fields X1,...,Xp € Vect(M) such that X;(p) € E> for all i and p and 
the vectors X1(po),.-.,Xn(po) form a basis of E,,. Using Theorem 2.6.10 
again we find vector fields Yj,...,¥m—n € Vect(/) such that the vectors 


Xi (po); oe ,Xn(po), Yi(po), nies :Yan-nPo) 


form a basis of T,,M. Using cutoff functions as in the proof of Theo- 
rem 2.6.10 we may assume without loss of generality that the vector fields X; 
and Y; have compact support and hence are complete (see Exercise 2.4.13). 
Denote by ¢{,...,¢%, the flows of the vector fields X1,..., Xn, respectively, 
and by wi,...,W,-n the flows of the vector fields Y1,..., Yin—n- Define the 
map 


w:R?xR™" 5M 
by 
(x,y) = df o--- 0 Gan owt? o--- 0 Pe"? (po). 


82 CHAPTER 2. FOUNDATIONS 
By Lemma 2.7.3, this map satisfies 


0 


vy 
for all x € R” and y € R”™~”. Moreover, 


O 
50 0) = X;(po), 


Ow 
—— (0,0) = Y;(po), 
Fy, (000) = Yil0o) 
and so the derivative 

dw(0,0) : R” x R”™”" > T,,M 


is bijective. Hence, by the Inverse Function Theorem 2.2.17, there exists an 
open neighborhood 2 Cc R” x R™~" of the origin such that the set 


U :=v(Q) Cc M 
is an M-open neighborhood of po and wlg:2-—U is a diffeomorphism. 


Thus the vectors Ow /0Ox;(x, y) are linearly independent for every (x,y) € Q 
and, by (2.7.3), form a basis of Eyz,y). Hence 


o:= (d\o)—4 :U 3Q 


is a foliation box and this proves Theorem 2.7.2, assuming Lemma 2.7.3. 


To complete the proof of the Frobenius theorem it remains to prove 
Lemma 2.7.3. This requires the following result. 


Lemma 2.7.4. Let EC TM be an involutive subbundle. If 8 : R? > M is 
a smooth map such that 


OB OB 
5 6879) € F6(s,0)> By 688) € East) (2.7.4) 
for all s,t € R, then 
op 
5 6 t) € Fg(6.2); (2.7.5) 


for all s,t ER. 


Proof. See page 83. 
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Lemma 2.7.4 implies Lemma 2.7.3. Let X € Vect(W) be a complete vector 
field satisfying X(p) € Ep for every p € M and let ¢! be the flow of X. 
Choose a point po € M and a vector v9 € Ep,. By Theorem 2.6.10 there is a 
vector field Y € Vect(M/) with values in F such that Y (po) = vo. Moreover 
this vector field may be chosen to have compact support and hence it is 
complete (see Exercise 2.4.13). Thus there is a solution y : R > M of the 
initial value problem 


Define 6 : R? > M by 


for s,t € R. Then 


28 (s,0) = ¥(s) = Y(7(s)) € Egrs); 
28 (s,t) = X(B(s,t)) € Bacay 


for all s,t € R. Hence it follows from Lemma 2.7.4 that 


a6" (po)v0 = do"(4(0))4(0) = 20,4) € Eyrin 


for every t € R. This proves Lemma 2.7.3, assuming Lemma 2.7.4. 


Proof of Lemma 2.7.4. Given any point po € M we choose a coordinate 
chart @ : U > Q, defined on an M-open set U C M with values in an 
open set 2 C R” x R™™”, such that po € U and dé(po) Ep, = R” x {0}. 
Shrinking U, if necessary, we find that for every p€U the linear sub- 
space d¢(p)E, C R" x R™” is the graph of a matrix A € RO-)x2 Thus 
there exists a smooth map A: 2 > R(™-”™*” such that, for every p € U, 


do(p) Ey = {(§, A(z, WE) |EER"}, (wy) = O(p) EQ. (2.7.6) 
For (x,y) € 2 define the linear maps 
OA 


Fy (tr) -R" > Ress a y) /R™" 5 Rim—n)xn 
ax 


by 
OA “0A OA < O@A 


for € = (&,...,€)) € R" and 7 = (m,.-.-,%m—-n) € R™ ”. We prove the 
following. 
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Claim 1. Let (x,y) € Q, €,€’ € R” and define n, 7! € R™™” by n := A(z, y)E 
and 1 := A(x, y)é’. Then 
OA OA , (0A , OA 
(tow 6+ Few -n) = (Few 6+ Zev) 


The graphs of the matrices A(z) determine a subbundle E Cc Q x R™ with 
the fibers 


E,:= {(€n) €R" x R™"|n = A(w,y)g} 
for z = (x,y) € Q. This subbundle is the image of the restriction 
Elu = {(p, v) |p € U, VE Ep} 


under the diffeomorphism TM|y > Q x R™ : (p,v) © (¢(p), dé(p)v) and 
hence it is involutive. Now fix two elements €,€’ € R” and define the vector 
fields ¢,¢: Q 4 R™ by 


Cz) := (A(z), C2) =(€,4AZ)E), 2 € 2. 
Then ¢ and ¢’ are sections of E and their Lie bracket [¢, ¢’] is given by 
[¢, ’](z) = (0, (4A(z)¢"(z)) € — (4A(z)¢(z)) €') . 


Since E is involutive the Lie bracket [¢,¢’] must take values in the graph 
of A. Hence the right hand side vanishes and this proves Claim 1. 


Claim 2. Let I,J C R be open intervals and let z = (x,y): I x J > Q be 
a smooth map. Fix two points s5 € I and tp € J and assume that 


PH (so, to) = A(2®($0st0), (so, 0)) = (so, 0), (27-7) 
oH (s,1) = A(2(s,1),u(s,t)) S2(s,0) (2.7.8) 


for alls EI andte J. Then 


FY (50,2) = A(2($0.4),¥(s0,8)) 9" (s0,2) (2.7.9) 


for allt € J. 
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Equation (2.7.9) holds by assumption for t = tg. Moreover, dropping the 


argument z(so,t) = z = (a, y) for notational convenience we obtain 


a) & As) _ Oy aoe & Ox OA se ) Ox 


at \ as as dsot ~ Osadt dx Ot dy At) Os 
_ Py A Or x & Ox OA (43) Ox 


Osot Osot Ox Ot Oy Ot Os 
_ O7y A Or x OA Ox OA Ave Ox 
~ Osdt Osot Ox Os Oy Os ot 


Oy A Or x OA Ox OA Oy\ Ox 
OsOt Osot Ox Os Oy Os) Ot 

OA (Oy Ox Ox 

. A 
: & (3 =) at 
_ OA Oy Ave Ox 
Oy Os Os Ot 
Here the second step follows from (2.7.8), the third step follows from Claim 1, 
and the last step follows by differentiating equation (2.7.8) with respect to s. 
Define the curve 7: J > R'~” by 
Oy Ox 
n(t) = 5 (504) = A(x(s0, ), y(s0,t)) 5 (80. ). 
By (2.7.7) and what we have just proved, the curve 77 satisfies the linear 
differential equation 
OA 0 
Al = (F(elo0.2),v660.0) -n(®)) FECs0:2), lta) =o. 

Hence 7(t) = 0 for all t € J. This proves (2.7.9) and Claim 2. 

Now let 8 : R? + M be a smooth map satisfying (2.7.4) and fix a real 
number s9. Consider the set W := {t € R| 0;6(so,t) € Eas)t)}- By going 
to local coordinates, we obtain from Claim 2 that W is open. Moreover, W 


is obviously closed, and W # @ because 0 € W by (2.7.4). Hence W =R. 
Since sg € R was chosen arbitrarily, this proves (2.7.5) and Lemma 2.7.4. 


Any subbundle EF C TM determines an equivalence relation on M via 
there is a smooth curve y: [0,1] ~ M 
such that (0) = po, y(1) = pi, Y(t) € Ey) Vt 


If E is integrable, this equivalence relation is called a foliation and the 
equivalence class of po € M is called the leaf of the foliation through pg. 
The next example shows that the leaves do not need to be submanifolds 


pow Pp <= (2.7.10) 
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Example 2.7.5. Consider the torus M := $! x S$! Cc C? with the tangent 
bundle 


TM = { (21, 22, i121, iA2z2) €C*| fer] = ze] =1, Ar, A2 € R}. 
Let w1,w2 be real numbers and consider the subbundle 
E:= { (21, 22, itwy 21, itw2z2) E C4 | |z1| = |za] =1,¢¢€ R}. 
The leaf of this subbundle through z = (21, 22) € T? is given by 


L= Gage ella») ¢ E R} . 


It is a submanifold if and only if the quotient w;/w2 is a rational number 
(or w2 = 0). Otherwise each leaf is a dense subset of T?. 


Exercise 2.7.6. Prove that (2.7.10) defines an equivalence relation for every 
subbundle EF c TM. 


Exercise 2.7.7. Each subbundle EF C TM of rank 1 is integrable. 


Exercise 2.7.8. Consider the manifold M = R®. Prove that the sub- 
bundle E Cc TM = R® x R® with fiber E, = {(€,7,¢) € R®|¢ — yé = 0} 
over p = (x,y,z) € R® is not integrable and that any two points in R® can 
be joined by a path tangent to E. 


Exercise 2.7.9. Consider the manifold M = S° c R* = C? and define 
PSIG QeCxc’| HL cCLlaicie}. 


Thus the fiber 
bc LhS Ss 


is the maximal complex linear subspace of T,S°. Prove that E has real 
rank 2 and is not integrable. 


Exercise 2.7.10. Let & C TM be an involutive subbundle of rank n and 
let L C M be a leaf of the foliation determined by &. A subset V C L is 
called L-open iff it can be written as a union of submanifolds N of M 
with tangent spaces T,N = E, for p¢ N. Prove that the L-open sets form 
a topology on L (called the intrinsic topology). Prove that the obvious 
inclusion « : L + M is continuous with respect to the intrinsic topology on L. 
Prove that the inclusion 1: L + M is proper if and only if the intrinsic 
topology on L agrees with the relative topology inherited from M (called 
the extrinsic topology). 


Remark 2.7.11. It is surprisingly difficult to prove that each closed leaf D 
of a foliation is a submanifold of M. A proof due to David Epstein [19] is 
sketched in §2.9.4 below. 
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2.8 The Intrinsic Definition of a Manifold* 


It is somewhat restrictive to only consider manifolds that are embedded in 
some Euclidean space. Although we shall see that (at least) every compact 
manifold admits an embedding into a Euclidean space, such an embedding is 
in many cases not a natural part of the structure of a manifold. In particular, 
we encounter manifolds that are described as quotient spaces and there are 
manifolds that are embedded in certain infinite-dimensional Hilbert spaces. 
For this reason it is convenient, at this point, to introduce a more general 
intrinisc definition of a manifold. (See Chapter 1 for an overview.) This 
requires some background from point set topology that is not covered in 
the first year analysis courses. We shall then see that all the definitions and 
results of this chapter carry over in a natural manner to the intrinsic setting. 
We begin by recalling the intrinsing definition of a smooth manifold in 81.4. 


2.8.1 Definition and Examples 


Figure 2.14: Coordinate charts and transition maps. 


Definition 2.8.1 (Smooth m-manifold). Let m € No and M be a set. 
A chart on M is a pair (¢,U) where U C M and ¢ is a bijection from U 
to an open set 6(U) CR™. Two charts (¢1,U1), (¢2,U2) are called com- 
patible iff ¢;(U; A U2) and ¢2(U; 7 U2) are open and the transition map 


oa = d20 ob," : 1(U1 N U2) — $2(U1,N U2) (2.8.1) 


is a diffeomorphism. A smooth atlas on M is a collection & of charts 
on M any two of which are compatible and such that the sets U, as (¢,U) 
ranges over &, cover M (i.e. for every p € M there is a chart (¢,U) € & 
with p € U). A maximal smooth atlas is an atlas which contains every 
chart which is compatible with each of its members. A smooth m-manifold 
is a pair consisting of a set M and a maximal atlas & on M. 
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In Lemma 1.4.3 it was shown that, if @ is an atlas, then so is the 
collection & of all charts compatible with each member of <. Moreover, 
the atlas & is maximal, so every atlas extends uniquely to a maximal atlas. 
For this reason, a manifold is usually specified by giving its underlying set 
and some atlas on M. Generally, the notation for the atlas is suppressed and 
the manifold is denoted simply by M. The members of the atlas are called 
coordinate charts or simply charts on M. By Lemma 1.3.3 a smooth 
m-manifold admits a unique topology such that, for each chart (¢, U) of the 
smooth atlas, the set U C M is open and the bijection 


¢:U > ¢(U) 


is a homeomorphism onto the open set ¢(U) C R™. This topology is called 
the intrinsic topology of M and is described in the following definition. 


Definition 2.8.2. Let M be a smooth m-manifold. The intrinsic topology 
on the set M is the topology induced by the charts, t.e. a subset 


WcM 


is open in the intrinsic topology iff 6(U NW) is an open subset of R™ for 
every chart (¢,U) on Ms 


Remark 2.8.3. Let M Cc R* be smooth m-dimensional submanifold of R* 
as in Definition 2.1.3. Then the set of all diffeomorphisms (¢,U ™ M) as 
in Definition 2.1.3 form a smooth atlas as in Definition 2.8.1. The intrin- 
sic topology on the resulting smooth manifold is the same as the relative 
topology defined in 81.3. 


Remark 2.8.4. A topological manifold is a topological space such that 
each point has a neighborhood U homeomorphic to an open subset of R”. 
Thus a smooth manifold (with the intrinsic topology) is a topological man- 
ifold and its maximal smooth atlas & is a subset of the set .% of all 
pairs (¢,U) where U C M is an open set and ¢ is a homeomorphism from U 
to an open subset of R™. One says that the maximal smooth atlas & is a 
smooth structure on the topological manifold M iff the topology of M is 
the intrinsic topology of the smooth structure and every chart of the smooth 
structure is a homeomorphism. As explained in §1.4 a topological manifold 
can have many distinct smooth structures (see Remark 1.4.6). However, it 
is a deep theorem beyond the scope of this book that there are topological 
manifolds which do not admit any smooth structure. 


‘At this point we do not assume that the intrinsic topology on the manifold M is 
Hausdorff or second countable. These hypotheses will be imposed after the end of the 
present chapter. For explanations see page 91 and page 119. 


2.8. THE INTRINSIC DEFINITION OF A MANIFOLD* 89 


Example 2.8.5. The complex projective space CP” is the set 
CP? = {é Cc Ct! |@ is a 1-dimensional complex subspace } 
of complex lines in C”*!. It can be identified with the quotient space 
cP" = (C™\ {0}) /C’ 


of nonzero vectors in C”*+! modulo the action of the multiplicative group 


C* = C \ {0} of nonzero complex numbers. The equivalence class of a 
nonzero vector z = (z0,.--,2n) € C”*! will be denoted by 
[2] = Zo: Zit t | = f{Az|Ae|eC*} 
and the associated line is 2 = Cz. An atlas on CP” is given by the open 
cover Uj; := {[z0:---: 2n]| 2 40} for 7 = 0,1,...,n and the coordinate 
charts ¢; : U; > C” are 
20 Zi-1 Zi+1 En 
5 tea — wid ee : 2.8.2 
6i(leo soo and) = (2... EY.) (2.8.2) 


Exercise: Prove that each ¢; is a homeomorphism and the transition maps 
are holomorphic. Prove that the manifold topology is the quotient topology, 
ie. if t : C"* \ {0} + CP” denotes the obvious projection, then a sub- 
set U C CP” is open if and only if 7~!(U) is an open subset of C+! \ {0}. 


Example 2.8.6. The real projective space RP” is the set 
RP” = {¢ Cc R"*!| 2 is a 1-dimensional linear subspace} 
of real lines in R”*!. It can again be identified with the quotient space 
RP” = (R"*? \ {0}) /R* 


of nonzero vectors in R"+! modulo the action of the multiplicative group 
IR* = R\ {0} of nonzero real numbers, and the equivalence class of a nonzero 
vector z = (%0,...,2n) € R"*! will be denoted by 


[a] = [to 2 mi rst ey] = [Az |A eR}. 
An atlas on RP” is given by the open cover 
Uj = {leo s+: an] |i #0} 


and the coordinate charts ¢; : U; + R” are again given by (2.8.2), with z; 
replaced by x;. The arguments in Example 2.8.5 show that these coordinate 
charts form an atlas and the manifold topology is the quotient topology. The 


transition maps are real analytic diffeomorphisms. 
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Example 2.8.7. The real n-torus is the topological space 
Te REZ 


equipped with the quotient topology. Thus two vectors x,y € R” are equiv- 
alent iff their difference x — y € Z” is an integer vector and we denote by 
a: IR” > T” the obvious projection which assigns to each vector x € R” its 
equivalence class 

(ge) 2= lee 2 


Then a set U C T” is open if and only if the set ~!(U) is an open subset 
of R”. An atlas on T” is given by the open cover 


Ug := {[z]|a € R”, |x -—al < 1/2}, 


parametrized by vectors a € R”, and the coordinate charts ¢@q : Ug > R” 
defined by ¢q([z]) := x for « € R” with |x—a| < 1/2. Exercise: Show 
that each transition map for this atlas is a translation by an integer vector. 


Example 2.8.8. Consider the complex Grassmannian 
G,(C”) := {V Cc C" |v is a k-dimensional complex linear subspace} . 


This set can again be described as a quotient space Gz(C") = F,(C")/U(k). 
Here 
Fi (C") = {D Ee C"*k | D*D = 1} 


denotes the set of unitary k-frames in C” and the group U(k) acts on Fy(C”) 
contravariantly by D ++ Dg for g € U(k). The projection 


a: Fe(C") > Gz (C”) 


sends a matrix D € F;,(C”) to its image V := a(D) := imD. A subset 
U Cc Gx(C") is open if and only if 7~'(U) is an open subset of Fy(C”). Given 
a k-dimensional subspace V C C” we can define an open set Uy C G;(C”) as 
the set of all k-dimensional subspaces of C” that can be represented as graphs 
of linear maps from V to V+. This set of graphs can be identified with the 
complex vector space Hom (V, V+) of complex linear maps from V to V+ 
and hence with C(—*)**, This leads to an atlas on G,(C”) with holomorphic 
transition maps and shows that G;(C”) is a manifold of complex dimension 
kn — k?. Exercise: Verify the details of this construction. Find explicit 
formulas for the coordinate charts and their transition maps. Carry this 
over to the real setting. Show that CP” and RP” are special cases. 
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Example 2.8.9 (The real line with two zeros). A topological space M 
is called Hausdorff iff any two points in M can be separated by disjoint 
open neighborhoods. This example shows that a manifold need not be a 
Hausdorff space. Consider the quotient space 


M :=R-x {0,1}/= 


where [x,0] = [x,1] for x £ 0. An atlas on M consists of two coordinate 
charts ¢9 : Up > R and ¢, : U; + R where 


U; := {[z,i]|x2 € R}, (2,4) = 


for 1 = 0,1. Thus M is a 1-manifold. But the topology on M is not 
Hausdorff, because the points [0,0] and [0, 1] cannot be separated by disjoint 
open neighborhoods. 


Example 2.8.10 (A 2-manifold without a countable atlas). Consider 
the vector space X = R x R? with the equivalence relation 


either y, = yo £0, ty + 21y1 = te + Fay 


t = It 
ler, 21, 91] = lea, 2, 99] or 41 = yo = 0, t] = to, V1 = Xo. 


For y 4 0 we have (0,2, y] = [t,z — t/y,y], however, each point (#,0) on 
the z-axis gets replaced by the uncountable set R x {(z,0)}. Our manifold 
is the quotient space M := X/ =. This time we do not use the quotient 
topology but the topology induced by our atlas (see Definition 2.8.2). The 
coordinate charts are parametrized by the reals: for t € R the set U;, Cc M 
and the coordinate chart ¢; : U; > R? are given by 


U; = {[é,2,y]|2,y € R}, x(t, v, y]) = (x,y). 


A subset U Cc M is open, by definition, iff ¢;(UMU;) is an open subset of R? 
for every t € R. With this topology each ¢; is a homeomorphism from U; 
onto R? and M admits a countable dense subset 9 := {[0, x, y]| x,y € Q}. 
However, there is no atlas on M consisting of countably many charts. (Each 
coordinate chart can contain at most countably many of the points [t, 0, 0].) 
The function f : M —> R given by f([t, xz, y]) := t+ xy is smooth and each 
point [t,0,0] is a critical point of f with value t. Thus f has no regular 
value. Exercise: Show that M is a path-connected Hausdorff space. 


In Theorem 2.9.12 we will show that smooth manifolds whose topology is 
Hausdorff and second countable are precisely those that can be embedded in 
Euclidean space. Most authors tacitly assume that manifolds are Hausdorff 
and second countable and so will we after the end of the present chapter. 
However before 82.9.1 there is no need to impose these hypotheses. 


92 CHAPTER 2. FOUNDATIONS 


2.8.2 Smooth Maps and Diffeomorphisms 


Our next goal is to carry over all the definitions from embedded manifolds 
in Euclidean space to the intrinsic setting. 


Definition 2.8.11 (Smooth map). Let 


(M,{(¢a,Ua)}oea);  — (N, { (be, Va) baew) 


be smooth manifolds. A map f : M — N is called smooth iff it is continu- 
ous and the map 


fea = 0g° f0¢b5' : ba(UaNn f~'(Vp)) + ve(Va) (2.8.3) 


is smooth for every a € A and every 8 € B. It is called a diffeomorphism 
iff it is bijective and f and f~' are smooth. The manifolds M and N are 
called diffeomorphic iff there exists a diffeomorphism f : M—> N. 


The reader may check that the notion of a smooth map is independent 
of the atlas used in the definition, that compositions of smooth maps are 
smooth, and that sums and products of smooth maps from M to R are 
smooth. 


Exercise 2.8.12. Let WM be a smooth m-dimensional manifold with an atlas 


A =i(b0,U 0) saca: 


Consider the quotient space 


M = (J {a} x da(Ua)/ ~ 


acA 


where 
def 


(a,2)~(8,y) <> = o3*(x) = $5" (y). 
for a,B € A, x € da(Uq), and y € dg(Ug). Define an atlas on M by 
Ua := {la,z]|2 €d¢a(Ua)}, balla, a]) = 2. 
Prove that M is a smooth m-manifold and that it is diffeomorphic to M. 


Exercise 2.8.13. Prove that CP! is diffeomorphic to S?. Hint: Stereo- 
graphic projection. 
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2.8.3 Tangent Spaces and Derivatives 


In the situation where M is a submanifold of Euclidean space and p € M we 
have defined the tangent space of M at p as the set of all derivatives +(0) of 
smooth curves y: R— M that pass through p = y(0). We cannot do this 
for manifolds in the intrinsic sense, as the derivative of a curve has yet to be 
defined. In fact, the purpose of introducing a tangent space of M is precisely 
to allow us to define what we mean by the derivative of asmooth map. There 
are two approaches. One is to introduce an appropriate equivalence relation 
on the set of curves through p and the other is to use local coordinates. 


Definition 2.8.14. Let M be a manifold with an atlas A = {(¢a,Ua) baea 
and let p € M. Two smooth curves yo,71 : R > M with yo(0) = 71(0) = p 
are called p-equivalent iff for some (and hence every) a € A with p € Us 


we have 
d 


d 
F|_,2°O) = a 


if _, ducal). 


We write Yo ne v1 iff Yo is p-equivalent to y, and denote the equivalence class 
of a smooth curve y: RR M with 7(0) =p by |[y|p. Every such equivalence 
class is called a tangent vector of M atp. The tangent space of M at p 
is the set of equivalence classes 


TpM := {[y|p|y: R > M is smooth and y(0) = p}. (2.8.4) 


Definition 2.8.15. Let M be a manifold with an atlas H = {(ba, Ua) ae 
and letp € M. The <-tangent space of M at p is the quotient space 
TYM := (J {a} xR™/4, (2.8.5) 
peva 


where the union runs over alla € A with p€ Ug and 


(a,6)*(8,n) = — d(dg0dq")(@)E=n, © = da(p): 
The equivalence class is denoted by [a, |, and is called a tangent vector. 


In Definition 2.8.14 it is not immediately obvious that the set T,M 
in (2.8.4) is a vector space. However, the quotient space T,’M in (2.8.5) is 
obviously a vector space of dimension m and there is a natural bijection 


TyM > TM : [yp | 


dalvtt))| (2.8.6) 


or 
dt ‘ 


t=0 


This bijection induces a vector space structure on the set T,M. In other 
words, the set TM in (2.8.4) admits a unique vector space structure such 
that the map T,M —> rf M in (2.8.6) is a vector space isomorphism. 
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Exercise 2.8.16. Verify the phrase “and hence every” in Definition 2.8.14 
and deduce that the map T,M — Te M in (2.8.6) is well defined. Show 
that it is bijective. 


From now on we will use either Definition 2.8.14 or Definition 2.8.15 or 
both, whichever way is most convenient, and drop the superscript &/. 


Definition 2.8.17 (Derivative of a smooth curve). For each smooth 
curve y:R— M with y(0) =p we define the derivative 7(0) € T,M as the 
equivalence class 


10) == ble ® [a 5 


salv(t)) € TM. 
Pp 


Definition 2.8.18 (Derivative of a smooth map). Let f : M— N 
be a smooth map between two smooth manifolds (M,{(¢a,Ua)}aca) and 
(N,{(wWe, Va)}eeR) and letp € M. The derivative of f at p is the map 


df (p) : TpM —> Tsp) N 


t=0 


defined by the formula 


df (p) [yp := [f ° VF@) (2.8.7) 


for each smooth curve y : R > M with y(0) = p. Here we use (2.8.4). 
Under the isomorphism (2.8.6) this corresponds to the linear map 


df (p)la, lp = (8, dfaal@)ély@, = bap), (2.8.8) 


fora € A with p € Ug and B € B with f(p) € Vg, where faq ts given 
by (2.8.3). 


Remark 2.8.19. Think of N = R” as a manifold with a single coordinate 
chart, namely the identity map wg = id: R” > R”. For every q ¢ N = R" 
the tangent space TN is then canonically isomorphic to R” via (2.8.5). 
Thus for every smooth map f : M > R" the derivative of f at pe Misa 
linear map df(p) : T,M — R", and the formula (2.8.8) reads 


df (p)[a,flp =Ad(fogy')(a)—, = galp). 
This formula also applies to maps defined on some open subset of M. In 
particular, with f = ¢dg : Ua > R™ we have 
doa(p)[a, Elp = €- 


Thus the map d¢,(p) : T,M — R™ is the canonical vector space isomor- 
phism determined by a. 


With these definitions the derivative of f at p is a linear map and we have 
the chain rule for the composition of two smooth maps as in Theorem 2.2.14. 
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2.8.4 Submanifolds and Embeddings 


Definition 2.8.20 (Submanifold). Let M be a smooth m-manifold and 
letn € {0,1,...,m}. A subset 


NCM 


is called an n-dimensional submanifold of M if, for every element p € N, 
there exists a local coordinate chart 


o:U 32 


for M, defined on an an open neighborhood U C M of p and with values in 
an open set Q CR” x R™™”, such that 


o(UNN) =0QN (R” x {0}). 
By Theorem 2.1.10 an m-manifold 
McR* 


in the sense of Definition 2.1.3 is a submanifold of R* in the sense of Defi- 
nition 2.8.20. By Theorem 2.3.4 the notion of a submanifold N Cc M ofa 
manifold M C R* in Definition 2.3.1 agrees with the notion of a submanifold 
in Definition 2.8.20. 


Exercise 2.8.21. Let N be a submanifold of M. Show that if M is Haus- 
dorff, so is N, and if M is paracompact, so is N. 


Exercise 2.8.22. Let N be asubmanifold of M and let P be a submanifold 
of N. Prove that P is a submanifold of M. Hint: Use Theorem 2.1.10. 


Exercise 2.8.23. Let N be a submanifold of MM. Prove that there exists an 
open set U C M such that N C U and N is closed in the relative topology 
of U. 


All the theorems we have proved for embedded manifolds and their proofs 
carry over almost word for word to the present setting. For example we have 
the inverse function theorem, the notion of a regular value, the implicit 
function theorem, the notion of an immersion, the notion of an embedding 
as a proper injective immersion, and the fact from Theorem 2.3.4 that a 
subset P C M is asubmanifold if and only if it is the image of an embedding. 
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Example 2.8.24 (Veronese embedding). The map 
CP? > CP®: [zg : 21: 2] Es : a : a : 212 : 2220 : 2021] 


is an embedding. (Exercise: Prove this.) It restricts to an embedding of 
the real projective plane into RP° and also gives rise to embeddings of RP? 
into R* as well as to the Roman surface: an immersion of RP? into R®. (See 
Example 2.1.17.) There are similar embeddings 


CP’? > CPN-1, 00 Nes ("7"). 


for all n and d, defined in terms of monomials of degree d in n+1 variables. 
These are the Veronese embeddings. 


Example 2.8.25 (Pliicker embedding). The Grassmannian G2(R*) of 
2-planes in R* is a smooth 4-manifold and can be expressed as the quotient 
of the space F2(IR*) of orthonormal 2-frames in R* by the orthogonal group 
O(2). (See Example 2.8.8.) Write an orthonormal 2-frame in R* as a matrix 


Zo Yo 

pal |. Pipi. 
T Y2 
23 YB 


Then the map f : G2(R*) > RP°, defined by 
f({D]) := [poi : Poa : Pos : P23 : P31 : Pig), Pig = ViYj — ViVi, 
is an embedding and its image is the quadric 
X := f(Ge(R*)) = {p € RP? | porp23 + porpsi + pospi2 = 0}. 


(Exercise: Prove this.) There are analogous embeddings 


f:C,(R") > RPN-1, 0 ONG= (;): 


for all k and n, defined in terms of the k x k-minors of the (orthonormal) 
frames. These are the Pliicker embeddings. 
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2.8.5 Tangent Bundle and Vector Fields 


Let M be a m-manifold with an atlas & = {(¢q,Ua)}ac4- The tangent 
bundle of M is defined as the disjoint union of the tangent spaces, i.e. 


TM = |) {p} x TM = {(p,v) |p M, ve TM}. 
pEeM 


Denote by 7 : TM — M the projection given by m(p,v) := p. Recall the 
notion of a submersion as a smooth map between smooth manifolds, whose 
derivative is surjective at each point. 


Lemma 2.8.26. The tangent bundle of M is a smooth 2m-manifold with 
coordinate charts 


a : Ua = a" (Ug) > ba(Ua) x R”, a(P,v) = (Ga(p), doa(p)v) : 


The projection t : TM — M is a surjective submersion If M is second 
countable and Hausdorff, so is TM. 


Proof. For each pair a, 8 € A the set 
ga(Ua Ug) = ba(Ua Ug) x R™ 
is open in R™ x R™ and the transition map 
$60 = $80 bq": $a(Ua Us) + $8(Ua Up) 


is given by 7 
Bala, E) = (d6a(2), ddga(x)€) 
for x € ¢qg(Ua MN Ug) and € € R™ where 


dpa = $99 bq 


Thus the transition maps are all diffeomorphisms and so the coordinate 
charts dba define an atlas on TM. The topology on TM is determined by 
this atlas via Definition 2.8.2. If M has a countable atlas, so does TM. The 
remaining assertions are easy exercises. 


Definition 2.8.27. Let M be a smooth m-manifold. A (smooth) vector 
field on M is a collection of tangent vectors X(p) € T,M, one for each 
point p€ M, such that the map M +> TM : p+> (p, X(p)) is smooth. The 
set of all smooth vector fields on M will be denoted by Vect(M). 
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Associated to a vector field is a smooth map M + TM whose composi- 
tion with the projection 7: TM — M is the identity map on M. Strictly 
speaking this map should be denoted by a symbol other than X, for exam- 
ple by X. However, it is convenient at this point, and common practice, 
to slightly abuse notation and denote the map from M to T’M also by X. 
Thus a vector field can be defined as a smooth map 


X:M—->TM 


such that 
qwoX =id:MoM. 


Such a map is also called a section of the tangent bundle. 
Now suppose = {(¢a,Ua)}ge,4 is an atlas on M and X :M—+TM 
is a vector field on M. Then X determines a collection of smooth maps 


Xa: ¢a(Ua) > R™ 
given by 
Xa(z) = dba(p)X(p), p= 65" (2), (2.8.9) 


for © € @a(Uq). We can think of each Xq as a vector field on the open set 
ga(Ua) C R™, representing the vector field X on the coordinate patch Ug. 
These local vector fields Xq satisfy the condition 


Xp(bpa(®)) = dopa) Xa(x) (2.8.10) 


for x € da(Uq_ Ug). This equation can also be expressed in the form 


Xalda(UarUs) = PBa%Blbg(UanUe): (2.8.11) 


Conversely, any collection of smooth maps Xq : ¢a(Ua) + R” satisfy- 
ing (2.8.10) determines a unique vectorfield X on M via (2.8.9). Thus we 


can define the Lie bracket of two vector fields X,Y € Vect(M) by 


[X, Ya() := [Xoa, Yal(x) = dXoa(x)Yo(x) —dYa(x)Xa(x) (2.8.12) 


fora € Aandz € ¢,(Uq). It follows from equation (2.4.18) in Lemma 2.4.21 
that the local vector fields 


IX, Y]a: ¢a(Ua) > R™ 
satisfy (2.8.11) and hence determine a unique vector field [X,Y] on M via 


[X, ¥](p) := dba(p) [Xa Yol(ga(p)),  p € Ua. (2.8.13) 
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Thus the Lie bracket of X and Y is defined on Ug as the pullback of the Lie 
bracket of the vector fields Xq and Yq under the coordinate chart ¢y. With 
this understood all the results in 82.4 about vector fields and flows along with 
their proofs carry over word for word to the intrinsic setting whenever M is a 
Hausdorff space. This includes the existence and uniquess result for integral 
curves in Theorem 2.4.7, the concept of the flow of a vector field in Defini- 
tion 2.4.8 and its properties in Theorem 2.4.9, the notion of completeness 
of a vector field (that the integral curves exist for all time), and the various 
properties of the Lie bracket such as the Jacobi identity (2.4.20), the formu- 
las in Lemma 2.4.18, and the fact that the Lie bracket of two vector fields 
vanishes if and only if the corresponding flows commute (see Lemma 2.4.26). 
One can also carry over the notion of a subbundle E Cc T'M of rank n to 
the intrinsic setting by the condition that E is a smooth submanifold of TM 
and intersects each fiber T,,M in an n-dimensional linear subspace 


Ey := {v € TM | (p,v) € E}. 


Then the characterization of subbundles in Theorem 2.6.10 and the theorem 
of Frobenius 2.7.2 including their proofs also carry over to the intrinsic 
setting whenever M is a Hausdorff space. 


2.8.6 Coordinate Notation 


Fix a coordinate chart ¢q : Ua > R™ on an m-manifold M. The components 
of dg are smooth real valued functions on the open subset U, of M and it 
is customary to denote them by 


fica se SR. 
The derivatives of these functions at p € U,, are linear functionals 
dx'(p):T,M—>R, i=1,...,m. (2.8.14) 
They form a basis of the dual space 
TM := Hom(T,M,R). 


(A coordinate chart on M can in fact be characterized as an m-tuple of real 
valued functions on an open subset of M whose derivatives are everywhere 
linearly independent and which, taken together, form an injective map.) 
The dual basis of T,.M will be denoted by 


0 0 


qi P)> — an € TM. (2.8.15) 
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Thus 9 
in) 2 ny _ ot. Jl fi=s, 
as! (v)zaa(0) = 8} = | 0, fi Xi), 
for i,j = 1,...,m and 0/0z’ is a vector field on the coordinate patch Ug. 
For each p € Ug it is the canonical basis of T,,M determined by ¢q. In the 
notation of (2.8.5) and Remark 2.8.19 we have 


O 
art (p) = [a, Caly = doo(p)~ ei, 


where e; = (0,...,0,1,0,...,0) (with 1 in the 7th place) denotes the stan- 
dard basis vector of R™. In other words, for all € = (€1,...,€”) € R™ and 
all p € Ug, the tangent vector 


v:= dba(p) 1 € TpM 


is given by 
v=[a€lp= DE asl). (2.8.16) 
i=1 


Thus the restriction of a vector field X € Vect(M) to Ug has the form 


Xlu, = ee 
where €',...,€ : Ug — R are smooth real valued functions. If the map 
Xa: ¢a(Ua) > R™ 
is defined by (2.8.9), then 
Mei, S16 cts 


The above notation is motivated by the observation that the derivative of 
a smooth function f : M — R in the direction of a vector field X on a 
coordinate patch U,q is given by 


bla Ye: 
re Ox 


Here the term Of/0x* is understood as first writing f as a function of 
x!,...,2, then taking the partial derivative, and afterwards expressing this 
partial derivative again as a function of p. Thus Of /0z* is the shorthand 
notation for the function (2 (f° ¢3")) 0 da: Ua > R. 
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2.9 Consequences of Paracompactness* 


In geometry it is often necessary to turn a construction in local coordinates 
into a global geometric object. A key technical tool for such “local to global” 
constructions is an existence theorem for partitions of unity. 


2.9.1 Paracompactness 


The existence of a countable atlas is of fundamental importance for almost 
everything we will prove about manifolds. The next two remarks describe 
several equivalent conditions. 


Remark 2.9.1. Let M be a smooth manifold and denote by 


Ua 


the topology induced by the atlas as in Definition 2.8.2. Then the following 
are equivalent. 


(a) M admits a countable atlas. 

(b) M is o-compact, i.e. there is a sequence of compact subsets kK; C M 
such that K; C int(K;41) for every i € N and M = U2, Ki. 

(c) Every open cover of M has a countable subcover. 

(d) M is second countable, i.e. there is a countable collection of open sets 


BC W such that every open set U € Y is a union of open sets from the 
collection &. (Z is then called a countable base for the topology of M.) 


That (a) (b) (c) (a) and (a) => (d) follows directly from the 
definitions. The proof that (d) implies (a) requires the construction of a 
countable refinement and the axiom of choice. (A refinement of an open 
cover {U;}ie7 is an open cover {Vj}je7 such that each set V; is contained in 
one of the sets U;.) 


Remark 2.9.2. Let M and Y be as in Remark 2.9.1 and suppose in ad- 
dition that M is a connected Hausdorff space. Then the existence of a 
countable atlas is also equivalent to each of the following conditions. 


(e) M is metrizable, i.e. there is a distance function d: M x M — [0, 00) 
such that WY is the topology induced by d. 

(f) M is paracompact, i.e. every open cover of M has a locally finite 
refinement. (An open cover {Vj;}je,7 is called locally finite iff every p € M 
has a neighborhood that intersects only finitely many V;.) 
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That (a) implies (e) follows from the Urysohn Metrization Theorem 
which asserts (in its original form) that every normal second countable topo- 
logical space is metrizable [51, Theorem 34.1]. A topological space M is 
called normal iff points are closed and, for any two disjoint closed sets 
A,B C M, there are disjoint open sets U,V C M such that A Cc U and 
BC Y. It is called regular iff points are closed and, for every closed set 
AC M and every b € M \ A, there are disjoint open sets U,V Cc M such 
that A Cc U and b € V. It is called locally compact iff, for every open 
set U C M and every p € U, there is a compact neighborhood of p con- 
tained in U. It is easy to show that every manifold is locally compact and 
every locally compact Hausdorff space is regular. Tychonoff’s Lemma 
asserts that a regular topological space with a countable base is normal [51, 
Theorem 32.1]. Hence it follows from the Urysohn Metrization Theorem 
that every Hausdorff manifold with a countable base is metrizable. That (e) 
implies (f) follows from a more general theorem which asserts that every 
metric space is paracompact (see [51, Theorem 41.4] and [62]). Conversely, 
the Smirnov Metrization Theorem asserts that a paracompact Haus- 
dorff space is metrizable if and only if it is locally metrizable, i.e. every 
point has a metrizable neighborhood (see [51, Theorem 42.1]). Since ev- 
ery manifold is locally metrizable this shows that (f) implies (e). Thus we 
have (a) => (e) <= (f) for every Hausdorff manifold. 

The proof that (f) implies (a) does not require the Hausdorff property 
but we do need the assumption that M is connected. (A manifold with 
uncountably many connected components, each of which is paracompact, is 
itself paracompact but does not admit a countable atlas.) Here is a sketch. 
If M is a paracompact manifold, then there is a locally finite open cover 
{Ua}aea by coordinate charts. Since each set U, has a countable dense 
subset, the set {a € A| Ug Ua, 4 0} is at most countable for each ag € A. 
Since M is connected we can reach each point from U,, through a finite 
sequence of sets Ug,,...,Ua, with Ug,_, Ua, #0. This implies that the 
index set A is countable and hence M admits a countable atlas. 


Remark 2.9.3. A Riemann surface is a 1-dimensional complex manifold 
(i.e. the coordinate charts take values in C and the transition maps are 
holomorphic) with a Hausdorff topology. It is a deep theorem in the theory 
of Riemann surfaces that every connected Riemann surface is necessarily 
second countable (see [2]). Thus pathological examples of the type discussed 
in Example 2.8.10 cannot be constructed with holomorphic transition maps. 
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Exercise 2.9.4. Prove that every manifold is locally compact. Find an ex- 
ample of a manifold M and a point po € M such that every closed neighbor- 
hood of po is non-compact. Hint: The example is necessarly non-Hausdorff. 


Exercise 2.9.5. Prove that a manifold M admits a countable atlas if and 
only if it is o-compact if and only if every open cover of M has a countable 
subcover if and only if it is second countable. Hint: The topology of R™ is 
second countable and every open subset of R™ is o-compact. 


Exercise 2.9.6. Prove that every submanifold M C R* (Definition 2.1.3) 
is second countable. 


Exercise 2.9.7. Prove that every connected component of a manifold M is 
an open subset of M and is path-connected. 


2.9.2 Partitions of Unity 


Definition 2.9.8 (Partition of unity). Let M be a smooth manifold. A 
partition of unity on M is a collection of smooth functions 


64: M = [0,1], aeéA, 


such that each point p © M has an open neighborhood V C M on which only 
finitely many 0g do not vanish, i.e. 


# {ae A] Oalv £0} < c, (2.9.1) 


and, for every p€ M, we have 


sa) =1. (2.9.2) 


acA 


Tf {Uatgea 18 an open cover of M, then a partition of unity {Oa}ge4 (indexed 
by the same set A) is called subordinate to the cover iff each 0, is 
supported in Ug, 1.e. 


supp(9a) = {p € M | @a(p) #0} C Ua. 


Theorem 2.9.9 (Partitions of unity). Let M be a smooth manifold whose 
topology is paracompact and Hausdorff. Then, for every open cover of M, 
there exists a partition of unity subordinate to that cover. 


Proof. See page 105. 
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Lemma 2.9.10. Let M be a smooth manifold with a Hausdorff topology. 
Then, for every open set V C M and every compact set K C V, there exists 
a smooth function &: M — [0,00) with compact support such that K(p) > 0 
for every p€ K and supp(k) CV. 


Proof. Assume first that K = {po} is a single point. Since M is a mani- 
fold it is locally compact. Hence there is a compact neighborhood C' Cc V 
of po. Since M is Hausdorff C is closed and hence the set U := int(C) 
is a neighborhood of po whose closure U C C is compact and contained 
in V. Shrinking U, if necessary, we may assume that there is a coordinate 
chart ¢: U + with values in some open neighborhood 2 Cc R™ of the 
origin such that (po) = 0. (Here m is the dimension of /.) Now choose a 
smooth function Kg : Q — [0,co) with compact support such that %9(0) > 0. 
Then the function «: M — [0,1], defined by «ly := ko 0 ¢ and K(p) :=0 
for p € M \U is supported in V and satisfies K(pp) > 0. This proves the 
lemma in the case where K is a point. 

Now let K be any compact subset of V. Then, by the first part of 
the proof, there is a collection of smooth functions kK, : M — [0,00), 
one for every p € K, such that K,(p) > 0 and supp(Kp) C V. Since 
is compact there are finitely many points p1,...,p, © K such that the 
sets {p € M| kp,(p) > 0} cover K. Hence the function « := dU; Kp; 18 sup- 
ported in V and is everywhere positive on K. This proves Lemma 2.9.10. 


Lemma 2.9.11. Let M be a topological space. If {Vi};c, ts a locally finite 
collection of open sets in M, then 


Uv-uUr. 


iE Ig i€Io 


for every subset Ip C I. 


Proof. The set Uj 1o Vi 18 obviously contained in the closure of Use Ip Vi: 
To prove the converse choose a point po € M \ Uie a V;. Since the collec- 
tion {V;}ier is locally finite, there exists an open neighborhood U of po such 


that the set ) := {i € 1|VinU FD} is finite. Hence the set 


Uo — U \ U V; 
t€1I9Nh 


is an open neighborhood of po and we have Uj) MV; = @ for every i € Ip. 
Hence po ¢ Ujer, Vi. This proves Lemma 2.9.11. 


2.9. CONSEQUENCES OF PARACOMPACTNESS* 105 


Proof of Theorem 2.9.9. Let {Ua} je, be an open cover of M. We prove in 
four steps that there is a partition of unity subordinate to this cover. The 
proofs of steps one and two are taken from [51, Lemma 41.6]. 


Step 1. There is a locally finite open cover {Vi},-, of M such that, for 
every i € I, the closure V; is compact and contained in one of the sets Ug. 


Denote by VY Cc 2™ the set of all open sets V C M such that V is compact 
and V Cc Ug, for some a € A. Since M is a locally compact Hausdorff 
space the collection Y is an open cover of M. (If p € M, then there is an 
a € A such that p € U,; since M is locally compact there is a compact 
neighborhood K C Us, of p; since M is Hausdorff K is closed and thus 
V := int(K) is an open neighborhood of p with V C K C U4.) Since M is 
paracompact the open cover VY has a locally finite refinement {V;}ie7. This 
cover satisfies the requirements of Step 1. 


Step 2. There is a collection of compact sets K; C V;, one for eachi € I, 
such that M = Use, Ki. 


Denote by W C 2™ the set of all open sets W C M such that W C V; for 
some 2. Since M is a locally compact Hausdorff space, the collection W is an 
open cover of M. Since M is paracompact W has a locally finite refinement 
{Wj}je7- By the axiom of choice there is a map 


Ja D5 4 tay 

such that 
W,;cV, Vied. 
Since the collection {W;}je7 is locally finite, we have 
Ki:= w= UWicv 

ij=i ij=i 
by Lemma 2.9.11. Since V; is compact so is K;. 
Step 3. There is a partition of unity subordinate to the cover {V;i}ier. 


Choose a collection of compact sets kK; C V; for 7 € J as in Step 2. Then, 
by Lemma 2.9.10 and the axiom of choice, there is a collection of smooth 
functions K; : M — [0,co) with compact support such that 


supp(K;) C Vi, Kilk, > 0 Viel. 
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Since the cover {V;}iez is locally finite the sum 


k= So ni:M3R 
tel 


is locally finite (i.e. each point in M has a neighborhood in which only 
finitely many terms do not vanish) and thus defines a smooth function on M. 
This function is everywhere positive, because each summand is nonnegative 
and, for each p € M, there is ani € I with p € Kj; so that «;(p) > 0. Thus 
the funtions y; := «;/« define a partition of unity satisfying supp(x;) C Vi 
for every 7 € I as required. 


Step 4. There is a partition of unity subordinate to the cover {Uahaea- 


Let {yi}ier be the partition of unity constructed in Step 3. By the axiom 
of choice there is a map I > A:i++ a; such that V; C Ug, for every 7 € I. 
For a € A define 0, : M — [0,1] by 


Oo 1= Ss" i: 


ay=a 


Here the sum runs over all indices i € I with a; = a. This sum is locally 
finite and hence is a smooth function on M. Moreover, each point in M has 
an open neighborhood in which only finitely many of the 0, do not vanish. 
Hence the sum of the 6, is a well defined function on M and 


Y= Vw Dwet 


acA acAay=a wel 


This shows that the functions 0, form a partition of unity. To prove the 
inclusion supp(9a) C Ua we consider the open sets 


Wi := {p © M|xi(p) > OF 


for 2 € I. Since W; C V; this collection is locally finite. Hence, by 
Lemma 2.9.11, we have 


supp(a)= LJ Wi= LJ Wi= LU supp0u) c Ul Vic ta. 


ay=a ay=a ay=a ay=a 


This proves Theorem 2.9.9. 
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2.9.3 Embedding in Euclidean Space 


Theorem 2.9.12. Let M be a second countable smooth m-manifold with a 
Hausdorff topology. Then there exists an embedding f : M — R?™+! with a 
closed image. 


Proof. The proof has five steps. 


Step 1. Let UCM be an open set and let K CU be a compact set. 
Then there exists an integer k €N, a smooth map f : M— R*, and an 
open set V C M, such that K CV CU, the restriction fly : V > R* is an 
injective immersion, and f(p) =0 for allpe M\U. 


Choose a smooth atlas & = {(¢a,Ua)}ae4a on M such that, for each a € A, 
either Ug C U or Un K = @. Since M is a paracompact Hausdorff man- 
ifold, Theorem 2.9.9 asserts that there exists a partition of unity {Oa}aea 
subordinate to the open cover {Ug }aca of M. Since the sets U, with Ua Cc U 
form an open cover of K and K is a compact subset of /, there exist finitely 
many indices aj,...,a¢ € A such that 


AC Ugg Uo Uap =v eu. 
Let k := (m+ 1) and, fori =1,...,2, abbreviate 
Pi = Pas 0; = 9a,;. 
Define the smooth map f : M —> R* by 


61 (p) 
91 (p)d1(p) 
f(p) = : for pe M. 
O¢(p) 
Ae(p)de(p) 


Then the restriction f|y : V > R* is injective. Namely, if pp,p; € V satisfy 


f (po) = f(r), 


then 

i {i | 6; (po) > o} = {i | 6;(p1) > 0} x ) 
and, fori € I, we have 6;(p9) = 9;(p1), hence $;(p0) = ¢i(p1), and so po = pr. 
Moreover, for every p € K the derivative df(p) : Tp M — R* is injective, and 
this proves Step 1. 
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Step 2. Let f : M—> R* be an injective immersion and let Ac R@™+))xk 
be a nonempty open set. Then there exists a matrix A € A such that the 
map Af :M > R?™*+! is an injective immersion. 


The proof of Step 2 uses the Theorem of Sard (see [1, 50]). The sets 


Wo = {(p,.9)€ Mx M|p# aq}, 
Wi := {(p,v) €TM |v #0} 


are open subsets of smooth second countable Hausdorff 2m-manifolds and 
the maps 


Fo: Ax Wo 9 R?™?, Fi: Ax W, 3 R?™1, 
defined by 
Fo(A, p, q) — A(f(p) — f(q)), F\(A, p, v) = Adf (p)v 


for A € A, (p,q) € Wo, and (p,v) € Wi, are smooth. Moreover, the zero 
vector in R?”+! is a regular value of Fp because f is injective and of F, 
because f is an immersion. Hence it follows from the intrinsic analogue of 
Theorem 2.2.19 that the sets 


Mo := Fo *(0) = {(A,p, 4) € Ax Wo| Af(p) = Af(Q)} 5 
My, := F,*(0) = {(A,p,v) € A x W,| Adf(p)v = 0} 


are smooth manifolds of dimension 
dim Mo = dim M,; = (2m+1)k—1. 


Since M is a second countable Hausdorff manifold, so are Mop and Mj. 
Hence the Theorem of Sard asserts that the canonical projections 


Mo oA: (Ay p, q) +> A=: m0(A, p, q); 
M, > A:(A,p,v) + A=: m(A,p,v), 


have a common regular value A € A. Since 
dim Mop = dim M, < dim A, 


this implies 


AE A (170(Mo) U ™1(M1)) 3 


Hence Af : M > R?”*! is an injective immersion and this proves Step 2. 
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If M is compact, the result follows from Steps 1 and 2 with K =U = M. 
In the noncompact case the proof requires two more steps to construct an 
embedding into R“’”** and a further step to reduce the dimension to 2m + 1. 


Step 3. Assume M is not compact. Then there exists a sequence of open 
sets U; C M, a sequence of smooth functions p; : M — [0,1], and a sequence 
of compact sets K; C U; such that 


supp(p;) C Ui, K;= oe () Cc U;, Un; =0 


for all i,j EN with |i-j| > 2 and M =U, Ki. 


Since M is second countable, there exists a sequence of compact sets C; C M 
such that C; C int(Cj+1) for all ic N and M=U,ey Ci (Remark 2.9.1). 
Define the compact sets B; C M by Cp := @ and 


BS C;\ Cia fori E€ N. 
Then M = U,cy Bi and, for all 7,7 ¢ N with j > i+ 2, we have 
B,C G, Cimt(Cjea), Bye C;\imt(C)1) 


and so Bj; .B; =9. Since M is metrizable by Remark 2.9.2, there exists 
a distance function d: M x M — [0,00) that induces the intrinsic topology 
on M. Define 


A= | U - By. a Agk,) =e d(p, q)- 
jEN\{i-1,i,i+1} 


Then A; is a closed subset of I, because any convergent sequence in M must 
belong to a finite union of the B;. Since A; B; = 0, this implies ¢; > 0. 
For 7 € N define the set U; C M by 


Uj := {p eM | there exists a gq € B; with d(p,q) < €i/3} : 


Then {U;};en is a sequence of open subsets of M such that B; C U; C Ci41 
for all i € N and U; NU; = 0 for |i — j| > 2. In particular, each set U; has a 
compact closure. 

For each 7 there exists of a partition of unity subordinate to the open 
cover M = U; U (M \ B;) and hence a smooth function p; : M — [0,1] such 
that supp(pi) C Uj and pip, = 1. Define Kj := p;‘(1) = {p € Ui | pi(p) = 1} 
for i¢ N. Then K; is a compact set and B; C K; C U; for each 1 EN. 
Hence M = Ujen Ki and this proves Step 3. 
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Step 4. Assume M is not compact. Then there exists an embedding 
f:MoR™ 


with a closed image and a pair of orthonormal vectors x,y € R4”** such 
that, for every € > 0, there exists a compact set K C M with 


fp) — sx — ty <€. (2.9.3) 


If(p)| 


sup 
peM\K s,tER 


Assume M is not compact and let K;,U;,p; be as in Step 3. Then, by 
Steps 1 and 2, there exists a sequence of smooth maps g; : M > R?™*! such 
that gilau, =9, the restriction gi|K,: Ki > R?™+! is injective, and the 
derivative dg;(p) : Tp) M — R?™*! is injective for all p € K; and all iE N. 
Let € € R?”*+! be a unit vector and define the maps f; : M > R?”*! by 


AG =p) ee (2.9.4) 
1+ lol? 


for p€ M andiéN. Then the restriction fj|x, : K; > R?™*! is injective, 
the derivative df;(p) : T,M —> R?”*" is injective for all p € K;, and 


supp(fi) C Vi, Fi( Ki) C Bi (48), fi(M) C Bi+1(0). 
Define the maps f°", f°’ : M — R?™+1 and p°44, p°Y : M > R by 
{ px—1(p), ifi © N and p € Uy-1, 
0, if p EM \ Usen Ua;-1, 
fo"(p) := { foi-1(p), if ie N and p € Uj;-1, 
0, if p € M \ Vien Vai-1, 
ev( ) a p2i(p), ifi © N and pE U;, 
BOMBS 06h: ifp € M \ Usen Unis 
f°" (p) — foi(p), if7 € N and pe U;, 
, 0, if p eM \ Vien U2, 


en'(p) = 


p 


and define the map f : M > R**+4 by 


FP) = (9°*4(p), fo"), 0™(P), F*()) 


forpe M. 
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We prove that f is injective. To see this, note that 


21 = 2 < | F°8*(p)| < 24, 
€ Kaji => ay : 
a {iF (P)| <2 +1, ae 
. 2i-—1<|f°(p)| < 264+1, _ 
pce 7 { |f°94(p)| < 2% + 2, 


Now let po,p1 € M such that f(po) = f(p1). Assume first that po € Kaj-1. 
Then p°44(p;) = p°44(p9) = 1 and hence p; € Ujen Koj-1. By (2.9.5), we 
also have 2i — 2 < |f°44(p1)| = | f°44(p9)| < 2i and hence p, € Ko;-1. This 
implies foi—1(p1) = f°“"(p1) = f°" (po) = fai-1(po) and so po = pi. Now 
assume po € Koj. Then p* (pi) = p (po) = 1 and hence py € Ujey K2;- 
By (2.9.5), we also have 2i — 1 < |f°’(p1)| = | f°’ (po)| < 2+ 1, so py € Kay, 
which implies f2i(p1) = f°’ (pi) = f°’ (po) = fei(po), and so again po = pr. 
This shows that f is injective. That f is an immersion follows from the fact 
that the derivative df;(p) is injective for all p € K; and alli € N. 

We prove that f is proper and has a closed image. Let (p,),en be a 
sequence in M such that the sequence (f(p,)),en in R“”*+* is bounded. 
Choose i € N such that |f°¢4(p,)| < 2i and | f°’ (p,)| < 2 +1 for all v EN. 
Then p, € Wea kK; for all v € N by (2.9.5). Hence (p,),en has a convergent 
subsequence. Thus f : M — R*”*+4 is an embedding with a closed image. 

Next consider the pair of orthonormal vectors 


x :=(0,€,0,0),  y:=(0,0,0,€) 


in R4™*4—Rx RY Rx R?™!_ Let (p,)ven be a sequence in M 
that does not have a convergent subsequence and choose a sequence 1, € N 
such that pp € Ko,-1 U Ko;, for all ye N. Then i, tends to infinity. 
If py € Ky;,-1 for all v, then we have limsup,_,,.|f°¢4(p,)|7!| fe" (py)| <1 
by (2.9.5). Passing to a subsequence, still denoted by (py),en, we may 
assume that the limit A := lim, ,..| f°4(f,)| “| f° (pv)| exists. Then 


odd ev 
i2h Zio ie Lfos*(py)| kon | __ __Y 


v-voo |f(py)| VT +2? v0 |f(py)| VT 4?” 
and it follows from (2.9.4) that 


Pe ia 
] SS ee SNS 
y=rce | fo44 (p,) 


f° (pv) 


ue Te ee 
SES TAG 


This implies 


; Lp) = ( g Ag ie : é 
ie Fo \° Vie Via) Vie * Vie 
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Similarly, if p, € Ko; for all v, there exists a subsequence such that the 
limit  := limysoo| f°’ (pv) |7"| f°" (pr)| exists and, by (2.9.4), this implies 


im L(pv) = (0 a 0 ) = : xs : y 
vee | f(py)| VTE OVI EN VIF? OVI 
This shows that the vectors x and y satisfy the requirements of Step 4. 


Step 5. There exists an embedding f : M > R?™*! with a closed image. 


For compact manifolds the result was proved in Steps 1 and 2 and for m = 0 
the assertion is obvious, because then M is a finite or countable set with 
the discrete topology. Thus assume that M is not compact and m > 1. 
Choose f : M —> R*"+4 and 2,y € R*”* as in Step 4 and define 


Av= d AE ROM+H)x(4m-+4) the vectors Ax and Ay 
— are linearly independent 


Since m > 1, this is a nonempty open subset of R2™+1)*(4m+4)_ We prove 
that the map Af: M— R?+! is proper and has a closed image for ev- 
ery A € A. To see this, fix a matrix A € A. Let (p,),en be a sequence in M 
that does not have a convergent subsequence. Then by Step 4 there exists a 
subsequence, still denoted by (p,),en, and real numbers s,t € R such that 


2 2 f (pv) : 
so +t =1, =sx-+ty, lim |f(p,)| = 00. 
vo | f(p,)| a (p.)| 
This implies 
Af (pv) 


im = sAr+tAy 40 
vee | f(pv)| 


and hence limy—oo |Af (py)| = co. Thus the preimage of every compact sub- 
set of R?”+! under the map Af : M— R?+! is a compact subset of M, 
and hence Af is proper and has a closed image (Remark 2.3.3). 

Now it follows from Step 2 that there exists a matrix A € A such that the 
map Af : M —> R?™*! is an injective immersion. Hence it is an embedding 
with a closed image. This proves Step 5 and Theorem 2.9.12. 


The Whitney Embedding Theorem asserts that every second count- 
able Hausdorff m-manifold M admits an embedding f : M— R?”. The 
proof is based on the Whitney Trick and goes beyond the scope of this 
book. The next exercise shows that Whitney’s theorem is sharp. 


Remark 2.9.13. The manifold RP? cannot be embedded into R®. The same 
is true for the Klein bottle K := R*/ = where the equivalence relation is 
given by [x,y] = [ec +k,@—y] forz,ye Rand k,leZ. 
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2.9.4 Leaves of a Foliation 


Let M be an m-dimensional paracompact Hausdorff manifold and EF Cc TM 
be an integrable subbundle of rank n. Let L C M be a closed leaf of the 
foliation determined by EF. Then L is a smooth n-dimensional submanifold 
of M. Here is a sketch of David Epstein’s proof of this fact in [19]. 


(a) The space L with the intrinsic topology admits the structure of a mani- 
fold such that the obvious inclusion 1: L > M is an injective immersion. 
This is an easy exercise. For the definition of the intrinsic topology see 
Exercise 2.7.10. The dimension of L is n. 


(b) If f: X > Y is a continuous map between topological spaces such that Y 
is paracompact and there is an open cover {V;}je7 of Y such that f—'(V;) 
is paracompact for each j, then X is paracompact. To see this, we may 
assume that the cover {V;}je, is locally finite. Now let {Ua}aea be an open 
cover of X. Then the sets U, f~!(V;) define an open cover of f~1(Vj). 
Choose a locally finite refinement {Wi;}icr, of this cover. Then the open 
cover {Wi;}jcu,icr1; Of M is a locally finite refinement of {Ua}aca- 


(c) The intrinsic topology of L is paracompact. This follows from (b) and 
the fact that the intersection of L with every foliation box is paracompact 
in the intrinsic topology. 


(d) The intrinsic topology of L is second countable. This follows from (a) 
and (c) and the fact that every connected paracompact manifold is second 
countable (see Remark 2.9.2). 


(e) The intersection of L with a foliation box consists of at most countably 
many connected components. This follows immediately from (d). 


(f) If L is a closed subset of M, then the intersection of L with a foliation 
box has only finitely many connected components. 'To see this, we choose 
a transverse slice of the foliation at po € LD, i.e. a connected submanifold 
T CM through po, diffeomorphic to an open ball in R™~”, whose tangent 
space at each point p € T is a complement of E,. By (d) we have that TNL 
is at most countable. If this set is not finite, even after shrinking T’, there 
must be a sequence p; € (TNL) \ {po} converging to po. Using the holonomy 
of the leaf (obtained by transporting transverse slices along a curve via a 
lifting argument) we find that every point p € TL is the limit point of a 
sequence in (TL) \ {p}. Hence the one-point set {p} has empty interior in 
the relative topology of TL for each p€ TAL. Thus TL is a countable 
union of closed subsets with empty interior. Since 7M L admits the structure 
of a complete metric space, this contradicts the Baire category theorem. 


(g) It follows immediately from (f) that L is a submanifold of M. 
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2.9.5 Principal Bundles 


An interesting class of foliations arises from smooth Lie group actions. 
Let GC GL(N,R) be a compact Lie group and let P be a smooth m- 
manifold whose topology is Hausdorff and second countable. A smooth 
(contravariant) G-action on P is a smooth map 


PxG—>P:(p,g) > pg (2.9.6) 


that satisfies the conditions 


(pg)h=p(gh), pl=p (2.9.7) 


for all p€ P and all g,h © G. Fix any such group action. Then every 
group element g € G determines a diffeomorphism P > P : p+> pg, whose 
derivative at p € P is denoted by T7,P + Tp,gP:vt+ vg. Every Lie alge- 
bra element € € g := Lie(G) = 7)G determines a vector field X¢ € Vect(P) 
which assigns to each p € P the tangent vector 


d 
Xe¢(p) = p= 7 pexp(t€) € T,P. (2.9.8) 
t=0 
The linear map g — Vect(P) : € > X¢ is called the infinitesimal action. 
It is a Lie algebra anti-homomorphism because the group action is contra- 
variant. (Exercise: Prove that [X¢, X,] = —Xjen) for €,7 € g.) The group 
action (2.9.6) is said to be with finite isotropy iff the isotropy subgroup 


Gp := {g € G| pg = p} 


is finite for all p¢ P. The isotropy subgroup Gp is a Lie subgroup of G 
with Lie algebra g, := Lie(G,) = {€ € g| Xe(p) = 0}. Since G is compact, 
this shows that G, is a finite subgroup of G if and only if g, = {0} 
or, equivalently, the map g > T,P :€ +> X¢(p) = p€ is injective. Thus, in 
the finite isotropy case, the group action determines an involutive subbun- 
dle E C TP with the fibers E, := pg = {Xe(p)|€ € g} for pe P. When G 
is connected, the leaves of the corresponding foliation are the group or- 
bits pG := {pg|g € G}. These are the elements of the orbit space 


P/G := {pG|pe€ P}. 


There is a natural projection 7 : P + P/G defined by z(p) := pG for p € P 
and the orbit space P/G is equipped with the quotient topology (a sub- 
set U C P/G is open if and only if t~!(U) is an open subset of P). The 
group action is called free iff G, = {1} for all pe P. The next theorem 
shows that, in the case of a free action, the quotient space admits a unique 
smooth structure such that the projection 7 : P + P/G is a submersion. 
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Theorem 2.9.14 (Principal Bundle). Let P be a smooth m-manifold 
whose topology is Hausdorff and second countable. Suppose P is equipped 
with a smooth contravariant action of a compact Lie group G and assume 
the group action is free. Then dim(G) < m and B := P/G admits a unique 
smooth structure such that the projection 7: P + B is a submersion. The 
intrinsic topology of B, induced by the smooth structure, agrees with the 
quotient topology, and it is Hausdorff and second countable. 


Proof. For each p € P the map G > P:g pg is an embedding and this 
implies k := dim(G) < dim(P) =m. Define n:=m—k. A local slice of 
the group action is a smooth map z : 2 — P, defined on an open set 2 C R”, 
such that the map 2 x G > P: (a,g) > U(x)g is an embedding. With this 
understood, we prove the assertions in five steps. 


Step 1. For every po € P there exists a local slice t9 : Xo > P, defined on 
an open neighborhood Qo C R” of the origin, such that t9(0) = po. 


Choose a coordinate chart 6: V > R™ on an open neighborhood V Cc P 
of po such that (po) = 0 and ¢(V) = R™. Define v1,...,Um € Tp,P by 


dd(po)vu; := e; 10r ? = 1,600 Mt, 


where €1,...,€m is the standard basis of R™. Reorder the coordinates 
on R”, if necessary, such that the vectors v1,..., Un project to a basis of the 
quotient space T,, P/pog. Define .: R" + P by 


i(x1,...,2n) = @ 1 (a1,-..,2n,0,---,0) 
and define the map wy: R” x G > P by 
W(x, g) = U(x)g for z € R" andg €G. 
Then w is smooth and its derivative dy(0, 1) : R” x g > T,,P is given by 
deh (0, 1)(#,) = S° Bivi + pok 
i=1 


for © = (Z1,...,%n) € R” and € € g. Hence dy(0, 11) is bijective and so it 
follows from the Inverse Function Theorem 2.2.17 that there exist open 
neighborhoods 2 C R” of 0, Q1 C G of 1, and W C P of po such that the 
restricted map 


W1 = vlaxa, +2 x 1 > W 


is a diffeomorphism. 
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Next we prove that there exists an open neigborhood 9 C 2 of the 
origin such that the restricted map 


wWo = WlQoxG : Qo xGoP 


is injective. Suppose otherwise that no such neighborhood Qo exists. Then 
there exist sequences (2;, gi), (z/, 9) € Q x G such that (xi, 9:1) 4 (24, g/) 
and w(x;,9:) = v(4, g/) for all i and the sequences (2%;);en and (2/)ien in Q 
converge to the origin. Since G is compact we may assume, by passing to 
a subsequence if necessary, that the sequences (g;)ien and (g/)ien converge. 
Denote the limits by 


g:= lim g €G, g := lim gi EG. 
1-00 t+00 


Then 
pog = lim. 1(a;)gi = Tim 1(a4) gi = pos! 
and so g = g' because the group action is free. Thus the sequence (gig; 1) en 
in G converges to Il and hence belongs to the set (; for 7 sufficiently large. 
Since 
di (wi, 1) = ofa) = (ah) gig; * = r(x}, gg; *) 

for all 7, this contradicts the injectivity of ~,. Thus we have proved that 
the map Wo : Qo x G > P is injective for a suitable neighborhood Q9 C 2 
of the origin. That it is an immersion is a direct consequence of the formula 


dwo(x, 9)(@, G) = (de(x)& + u(x)(Gg~")) 9g = (dvo(a, 1)(2, G9~")) 9 


for all x € Qo, © € R”, g € G, and g € T,G, and the fact that the deriva- 
tive dwo(x, 11) is bijective for all 2 € Qo (even for all x € 2). 

Thus we have proved that Wo : Qo x G > P is an injective immersion. 
Shrinking Qo further, if necessary, we may assume that 09 has a compact 
closure and that w is injective on Q9 x G. This implies that wo is proper. 
Namely, if (@j,9i);en is a sequence in Q x G and (2,g) € Qo x G such 
that vo(x, 9) = limi 4. Wo(%i, gi), then there is a subsequence (2;,,, gi, ven 
that converges to a pair (a’,g’) € No x G. This subsequence satisfies 


v(2",g') = Jim, Wo(Li,, Gir) = w(x, 9). 


Since ~ is injective on Qo x G, this implies ¢ = 2’ and g = g’. Thus 
every subsequence of (2;,9;)ien has a further subsequence that converges 
to (a,g) and so the sequence (2;, g9;)icn itself converges to (x,g). Thus the 
map Wo : Qo Xx G > Pisa proper injective immersion and this proves Step 1. 
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Step 2. Letv:Q— P be a local slice. Then the set U := 1(t(Q)) C B is 
open in the quotient topology and the map 701: 02 —- U is a homeomorphism 
with respect to the quotient topology on U. 


The map ~ : 2x G > P, defined by ~(z, g) := u(x)g for x € ND and g EG, 
is an embedding. Hence W := 7y(Q x G) is an open G-invariant subset 
of P and w:Qx G-— W is a G-equivariant homeomorphism. Moreover, 
for every element p € P, we have z(p) € U if and only if there exists 
an element « € Q and an element g € G such that p=u(x)g = Y(a, 9). 
Thus 1~!(U) = y(Q x G) = W is an open subset of P, and so U is an open 
subset of B = P/G with respect to the quotient topology. The continuity 
of tor: 92 U follows directly from the definition. Moreover, if 2’ C 2 is 
an open set and U’ := r(u(’)), then t7!(U’) = W(M! x G) is open by the 
same argument, and so U’ C B is open with respect to the quotient topology. 
Thus 7 04:9. U is a homeomorphism and this proves Step 2. 


Step 3. By Step 1 there exists a collection tg: Q. 74 P, a€ A, of local 
slices such that the sets Ug := T(ta(Qa)) cover the orbit space B = P/G. 
Fora€A define 

ba = (101) 12 Ueg > Nes 


Then & = {(ba,Ua)}aea 18 a@ smooth structure on B which renders the 
canonical projection 7: P + B into a submersion. Moreover, this smooth 
structure is compatible with the quotient topology on B. 

For a, 8 € A define Qag := ¢a(Ua M Ug) and bga := $8 ° 61 : Nag > Qga- 
We must prove that ¢gq is smooth. To see this, define Yq: Qa x G— P 
by a(x, g) := la(x)g for ae A, x EN, and g EG. Then wz, is a diffeo- 
morphism onto its image and Woa(Qag x G) = g(Qga x G) = 7 1(Ug NUz). 
For x € Qeg the element dga(x) € Qgaq is the projection of v5! © Wa(x, 1) 
onto the first factor. Thus ¢gq is smooth and so is its inverse ¢gg. This 
shows that {(Ua,¢a)}aea is a smooth structure on B. Second, 7 is a sub- 
mersion with respect to this smooth structure, because $y 0 7 0 Wa(x,g) = x 
for alla € A, alla € Qa, and all g € G. Third, this smooth structure is com- 
patible with the quotient topology by Step 2. This proves Step 3. 


Step 4. There is only one smooth structure on B with respect to which the 
projection 7: P — B is a submersion. 


Fix any smooth structure on B for which the projection 7: P> Bisa 
submersion. Then the dimension of B is n = dim(P) — dim(G), and so the 
smooth structure consists of bijections ¢g : Ug > Qa from subsets Uz C B 
onto open sets 2, C R” such that the sets Ug cover B and the transition 
maps are diffeomorphisms between open subsets of R”. 
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We prove that the intrinsic topology on B agrees with the quotient topol- 
ogy. To see this, fix a subset U C B. Then the following are equivalent. 


(a) U is open with respect to the intrinsic topology on B. 
(b) ¢a(U NUq) is open in R” for alla € A. 

(c) t-!(U NU,) is open in P for all a€ A. 

(d) 7~!(U) is open in P. 

(e) U is open with respect to the quotient topology on B. 


The equivalence of (a) and (b) follows from the definition of the intrinsic 
topology. That (b) implies (c) follows from the three observations that the 
set m—~!(Uq) is open in P, the map ¢,07:77!(Ug) 4 Qe is continuous, 
and (¢q 07) ~'(¢a(U NUa)) = 7 '(U NU). That (c) implies (b) follows 
from the fact that the map ¢g 0 7 : t~!(Uq) + Qe is a submersion and hence 
maps the open set 7~'(U MU.) onto an open subset of Q4 (Corollary 2.6.2). 
The equivalence of (c) and (d) follows from the fact that the map 7: P + B 
is continuous and U, C B is open (both with respect to the intrinsic topology 
on B) and so 7~!(U,) is open in P for all a € A. The equivalence of (d) 
and (e) follows from the definition of the quotient topology on B. 

Now let 1: Q + P be a local slice and define the set U := 7(u(Q)) C B 


and the map @:= (70v)~!: U +Q. Then the composition 
ba° 6 = ¢,07 06: ¢(UNUg) > ba(U NU) 


is a homeomorphism between open subsets of R”. Moreover, dy 0 ¢~! is the 
composition of the smooth maps z : {x € Q| r(u(x)) € Ua} 3 wm (UU NU), 
qin (UNUg) 3 UNU,, and ¢,:UNUeg > be(U NU2). So b,°¢% is 
smooth and its derivative is everywhere bijective because 7 is a submersion 
and the kernel of dr(v(a)) is transverse to the image of di(x). Thus ¢a 0 ¢! 
is a diffeomorphism by the Inverse Function Theorem and this proves Step 4. 


Step 5. The quotient topology on B is a Hausdorff and second countable. 


Let tg: Qq 74 P for a€ A be a collection of local slices such that the 
sets Ug := 1(ta(Qa)) cover B. Then the open sets 7~!(U,) form an open 
cover of P and so there is a countable subcover. Thus B is second count- 
able. To prove that B is Hausdorff, fix two distinct elements bo, b,; € B and 
choose po, pi € P such that (po) = bo and z(p1) = b1. Then poG and piG 
are disjoint compact subsets of P and hence can be separated by disjoint 
open subsets Up, U; C P, because P is a Hausdorff space. Now for 7 = 0,1 
the set Vi := {p € P| pG C U;} is open (exercise) and contains the orbit p;G. 
Hence Wo := 7(Vo) and Wy := 7(Vj) are disjoint open subsets of B such 
that bo € Wo and b; € Wy. This proves Step 5 and Theorem 2.9.14. 
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Example 2.9.15. There are many important examples of free group actions 
and principal bundles. A class of examples arises from orthonormal frame 
bundles (§3.4). The complex projective space B = CP” arises from the 
action of the circle G = S! on the unit sphere P = S2"+! c C"+! (Exam- 
ple 2.8.5). The real projective space B = RP” arises from the action of the 
finite group G = Z/2Z on the unit sphere P = S” C Rt! (Example 2.8.6). 
The complex Grassmannian B = G;(C”) arises from the action of G = U(k) 
on the space P = F;(C”) of unitary k-frames in C” (Example 3.7.6). If G is 
a Lie group and K C G is a compact subgroup, then by Theorem 2.9.14 the 
homogeneous space G/K admits a unique smooth structure such that the 
projection 7 : G > G/K is a submersion. The example SL(2,C)/SU(2) can 
be identified with hyperbolic 3-space (§6.4.3), the example U(n)/O(n) can 
be identified with the space of Lagrangian subspaces of a symplectic vector 
space [49, Lemma 2.3.2], the example Sp(2n)/U(n) can be identified with 
Siegel upper half space or the space of compatible linear complex structures 
on asymplectic vector space [49, Lemma 2.5.12], and the example G2/SO(4) 
can be identified with the associative Grassmannian [68, Remark 8.4]. The 
last three examples go beyond the scope of the present book. 


Standing Assumption 


We have seen that all the results in the present chapter carry over to the 
intrinsic setting, assuming that the topology of M is Hausdorff and paracom- 
pact. In fact, in many cases it is enough to assume the Hausdorff property. 
However, these results mainly deal with introducing the basic concepts like 
smooth maps, embeddings, submersions, vector fields, flows, and verifying 
their elementary properties, i.e. with setting up the language for differen- 
tial geometry and topology. When it comes to the substance of the subject 
we shall deal with Riemannian metrics and they only exist on paracompact 
Hausdorff manifolds. Another central ingredient in differential topology is 
the theorem of Sard and that requires second countability. To quote Moe 
Hirsch [29]: “Manifolds that are not paracompact are amusing, but they 
never occur naturally and it is difficult to prove anything about them.” 
Thus we will set the following convention for the remaining chapters. 


We assume from now on that each intrinsic manifold M 
is Hausdorff and second countable and hence is also paracompact. 


For most of this text we will in fact continue to develop the theory for 
submanifolds of Euclidean space and indicate, wherever necessary, how to 
extend the definitions, theorems, and proofs to the intrinsic setting. 
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Chapter 3 


The Levi-Civita Connection 


For a submanifold of Euclidean space the inner product on the ambient space 
determines an inner product on each tangent space, the first fundamental 
form. The second fundamental form is obtained by differentiating the map 
which assigns to each point in M C R” the orthogonal projection onto the 
tangent space (§3.1). The covariant derivative of a vector field along a curve 
is the orthogonal projection of the derivative in the ambient space onto the 
tangent space (§3.2). We will show how the covariant derivative gives rise 
to parallel transport (§3.3), examine the frame bundle (§3.4), discuss mo- 
tions without “sliding, twisting, and wobbling”, and prove the development 
theorem (§3.5). 

In §3.6 we will see that the covariant derivative is determined by the 
Christoffel symbols in local coordinates and thus carries over to the in- 
trinsic setting. The intrinsic setting of Riemannian manifolds is explained 
in 83.7. The covariant derivative takes the form of a family of linear op- 
erators V : Vect(y) + Vect(y), one for every smooth curve y: I > M, and 
these operators are uniquely characterized by the axioms of Theorem 3.7.8. 
This family of linear operators is the Levi-Civita connection. 


3.1 Second Fundamental Form 


Let M Cc R” be a smooth m-manifold. Then each tangent space of M is an 
m-dimensional real vector space and hence is isomorphic to R’™. Thus any 
two tangent spaces TM and T,M are of course isomorphic to each other. 
While there is no canonical isomorphism from 7,M to T,M we shall see 
that every smooth curve y in M connecting p to q induces an isomorphism 
between the tangent spaces via parallel transport of tangent vectors along ¥. 
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Throughout we use the standard inner product on R” given by 
(v, w) = vjpwi + vagwe + +++ + UnWn 


for v = (U1,.--,Un) € R” and w = (wj,...,wWp) € R”. The associated Eu- 
clidean norm will be denoted by 


pl = V@,v) = et t- +e 


for v = (v1,...,Un) € R”. When M Cc R” is a smooth m-dimensional sub- 
manifold, a first observation is that each tangent space of M inherits an inner 
product from the ambient space R”. The resulting field of inner products is 
called the first fundamental form. 


Definition 3.1.1. Let MCR” be a smooth m-dimensional submanifold. 
The first fundamental form on M is the field which assigns to each p © M 
the bilinear map 
Gp: IpM x Tp>M > R 
defined by 
Op(v, w) = (v, w) (3.1.1) 
forv,w€ TM. 


A second observation is that the inner product on the ambient space also 
determines an orthogonal projection of R” onto the tangent space T,,M for 
each p € M. This projection can be represented by the matrix II(p) € R"*” 
which is uniquely determined by the conditions 


Ip) = Tie)? =i)", (3.1.2) 


and 
II(p)v = v => vé€T,M (3.1.3) 


for p € M and v € R” (see Exercise 2.6.9). 


Lemma 3.1.2. The map IL: M > R"®*” defined by (3.1.2) and (3.1.3) is 
smooth. 


Proof. This follows directly from Theorem 2.6.10 and Corollary 2.6.12. More 
explicitly, if U C M is an open set and ¢: U > Q is a coordinate chart onto 
an open subset 2 C R™ with the inverse y := @7!:Q > U, then 


-1 
H(p) = dv(o(p)) (av(o())d(H(P)))— ad(H(W)) 


for p € U and this proves Lemma 3.1.2. 
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Figure 3.1: A unit normal vector field. 


Example 3.1.3 (Gau8 map). Let M Cc R™*! be a submanifold of codi- 
mension one. Then TM is a vector bundle of rank one (Corollary 2.6.13), 
and so each fiber T,,M+ is spanned by a unit vector v(p) € R™, determined 
by 7,M up to a sign. By Theorem 2.6.10 each po € M has an open neigh- 
borhood U Cc M on which there exists a smooth map 


yiU 3 R™! 


satisfying 
v(p)LT,)M, — |v(p)|=1 (3.1.4) 


for all p € U (see Figure 3.1). Such a map v is called a Gau8 map. The 
function II: M — R”*” is in this case given by 


I(p) = 1— v(p)v(p)" (3.1.5) 
for p € U. 
Example 3.1.4. Let M = S? c R’. Then v(p) = p and so 


1—<¢? ry LZ 
T(p)=1-pp'=| -ye 1-y? —yz 
22x —zy 1-2? 


for p = (x,y,z) € S?. 
Example 3.1.5 (M6bius strip). Consider the submanifold 


x = (1+ rcos(6/2)) cos(@), 
M := ¢ (x,y,z) € R?| y = (1 + rcos(0/2)) sin(6), 
z=rsin(0/2), 7,8 ER, |r| <eé 


for ¢ > 0 sufficiently small. Show that there does not exist a global smooth 
function v: M > R? satisfying (3.1.4). 
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Example 3.1.6. Let U C R” be an open set and f : U > R"—™ beasmooth 
function such that 0 €¢ R"~” is a regular value of f and UN M = f~1(0). 
Then T,M = ker df(p) and 


11(p) = 1 ato)" (aP(w)af(v)) a(n) 
for every pe UNM. 


Example 3.1.7. Let Q Cc R™ be an open set and W:Q— M be a smooth 
embedding. Then Ty) M = imdy(a) and 


(h(a) = d(a) (ae(x)Tab(a))  dy(w)™ 
for every x € 2. 
Next we differentiate the map I: M— R”"*”" in Lemma 3.1.2. The 
derivative at p € M takes the form of a linear map 
dIl(p) : TpM — R°*" 


which, as usual, is defined by 

= 
dt} io 
for v € T,M, where y : R — M is chosen such that 7(0) = p and 4(0) = v 


(see Definition 2.2.13). We emphasize that the expression dII(p)v is a matrix 
and can therefore be multiplied by a vector in R”. 


dIl(p)v : T(q(t)) € R"™ 


Lemma 3.1.8. For allp€ M andv,w €T,M we have 
(dII(p)v) w = (dII(p)w)v € CM 


Proof. Choose a smooth path y : R > M and a vector field X : R > R” 
along y such that 


y¥O)=p, YyO)=v, X(0)=w. 
For example, we can choose X(t) := II(y(t))w. Then 
X(t) = I(y(t)) X(t) 
for every t € R. Differentiate this equation to obtain 
X(t) = W(4(é)) XH) + (a (9(t))4() X(0- (3.1.6) 

Hence 

(dIL(4(¢))() X() = (1-H) X() € Ty M~ (3.1.7) 
for every t € R and, with t = 0, we obtain (dII(p)v) w € T,Mt+. 
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Now choose a smooth map 
R? > M : (s,t) + 7(s,t) 


satisfying 


With s =t =0 we obtain 


(dII(p)w)v = (dII(p)v)w € T,»Mt+ 


and this proves Lemma 3.1.8. 


Definition 3.1.9. The collection of symmetric bilinear maps 
hy : TpM x TpM 4 TpM+, 


defined by 
hp(v, w) := (dII(p)v)w = (dII(p)w)v (3.1.8) 


forpe M andv,w € T,M is called the second fundamental form on M. 
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Example 3.1.10. Let M Cc R™*! be an m-manifold and v : M > $™ be 
a Gau8 map so that 7,M = v(p)+ for every p € M (see Example 3.1.3). 
Then H(p) = 1—v(p)v(p)' and hence 


hp(v, w) = —v(p)(dv(p)v, w) 
forp € M andv,w €T,M. 
Exercise 3.1.11. Choose a splitting R” = R™ x R”~™ and write the ele- 
ments of R” as tuples (x,y) = (41,.-.,%m;Y1,--+-;Yn—-m) Let M C R” bea 
smooth m-dimensional submanifold such that p= 0 € M and 


ToM =R™ x {0}, TyM+ ={0}xR™™. 


By the implicit function theorem, there are open neighborhoods Q Cc R” 
and V C R”~”” of zero and a smooth map f :Q — V such that 


MN(Qx V) = graph(f) = {(x, f(x)) |e € QF. 


Thus f(0) = 0 and df(0) = 0. Prove that the second fundamental form 
hp: T,M x T,M > 1 ae is given by the second derivatives of f, i.e. 


m af 
i=l ~~" 


for v,w € T,M = R™ x {0}. 


Exercise 3.1.12. Let M Cc R” be an m-manifold. Fix a point p € M and 
a unit tangent vector v € T,M so that |v| = 1 and define 


L:={p+itv+w|teR, wl TM}. 


Let 7 : (—€,¢) ~ MOL be a smooth curve such that 7(0) = p, 4(0) = v, 
and |+(t)| = 1 for all t. Prove that 


Draw a picture of M and L in the case n = 3 and m = 2. 
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X(t) 
ees 


M = 
Figure 3.2: A vector field along a curve. 


3.2 Covariant Derivative 


Definition 3.2.1. Let ICR be an open interval and let y: I> M be a 
smooth curve. A vector field along y is a smooth map X : I > R” such 
that X(t) € Ty4)M for everyt € I (see Figure 3.2). The set of smooth vector 
fields along y is a real vector space and will be denoted by 


Vect(y) == {X : I> R"|X is smooth and X(t) €TyMVteT}. 


The first derivative X(t) of a vector field along y at t € J will, in general, 
not be tangent to /. We may decompose it as a sum of a tangent vector 
and a normal vector in the form 


X(t) = U(y(t))X(é) + (1- (7) X (0), 
where II: M — R”*” is defined by (3.1.2) and (3.1.3). The tangential 


component of this decomposition plays an important geometric role. It is 
called the covariant derivative of X at t. 


Definition 3.2.2 (Covariant derivative). Let I C R be an open inter- 
val, let y: I + M be a smooth curve, and let X € Vect(y). The covariant 
derivative of X is the vector field VX © Vect(y), defined by 


VX (t) = (y(t) X(t) € TyyM (3.2.1) 
forte l. 


Lemma 3.2.3 (Gau8—Weingarten formula). The derivative of a vector 
field X along a curve y is given by 


X(t) = VX(t) + byw (4), X(0). (3.2.2) 


Here the first summand is tangent to M and the second summand is orthog- 
onal to the tangent space of M at y(t). 


Proof. This is equation (3.1.6) in the proof of Lemma 3.1.8. 
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It follows directly from the definition that the covariant derivative along 
acurve y: I > M isa linear operator V : Vect(7y) + Vect(7). The following 
lemma summarizes the basic properties of this operator. 


Lemma 3.2.4 (Covariant derivative). The covariant derivative satisfies 
the following axioms for any two open intervals I, J CR. 


(i) Let y: I + M be a smooth curve, let X: I +R be a smooth function, 
and let X € Vect(y). Then 


V(AX) =AX 4 AVX. (3.2.3) 


(ii) Let y: I+ M be a smooth curve, let a: J >I be a smooth function 
and let X € Vect(y). Then 


V(X 00) =6(VX 00). (3.2.4) 
(iii) Let y: 1 + M be a smooth curve and let X,Y € Vect(y). Then 
L(x, Y) = (VX,Y) + (X, VY). (3.2.5) 


(iv) Lety: I x J + M be a smooth map, denote by V; the covariant deriva- 
tive along the curve s++ y(s,t) (with t fixed), and denote by \, the covariant 
derivative along the curve t+ (s,t) (with s fixed). Then 

V;Ory = Vi0sY- (8.2.6) 


Proof. Part (i) follows from the Leibniz rule LAX )=AX +AX and (ii) 
follows from the chain rule 4(X 0c) = a(X oa). To prove part (iii), use 
the orthogonal projections II(7(t)) : R" + Tq) M to obtain 


d 


qe? = (X,Y) + (X,Y) 
= (X,TI(y)Y) + (7) X,Y) 
= (II(y)X, Y) + (X, II(y)Y) 


= (VX,Y) + (X,VY) 


Part (iv) holds because the second derivatives commute and this proves 
Lemma 3.2.4. 


Part (i) in Lemma 3.2.4 asserts that the operator V is what is called a 
connection, part (iii) asserts that it is compatible with the first fundamental 
form, and part (iv) asserts that it is torsion-free. Theorem 3.7.8 below as- 
serts that these conditions (together with an extended chain rule) determine 
the covariant derivative uniquely. 
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3.3. Parallel Transport 


Definition 3.3.1 (Parallel vector field). Let I C R be an interval and 
let-y: I> M be a smooth curve. A vector field X along y is called parallel 


if 
VX(t) =0 


for allt EI. 
Example 3.3.2. Assume m = k so that M Cc R™ is an open set. Then a 
vector field along a smooth curve 7: J + M is a smooth map X : I > R™. 


Its covariant derivative is equal to the ordinary derivative VX(t) = X(t) 
and hence X is is parallel if and only if it is constant. 


Remark 3.3.3. For every X € Vect(y) and every t € I we have 


VX (t) =0 —s X (t) cif Ty) M. 


In particular, ¥ is a vector field along y and V7¥(t) = I(y(t))¥(t). Hence + 
is a parallel vector field along y if and only if 7(¢) L Tyq)M for all t€ I. 
We will return to this observation in Chapter 4. 


In general, a vector field X along a smooth curve y : J + M is parallel 
if and only if X(t) is orthogonal to T,q)M for every ¢ and, by the Gau8— 
Weingarten formula (3.2.2), we have 


Vx =0 cs X =h,(¥,X). 


The next theorem shows that any given tangent vector vo € Ty(z))M extends 
uniquely to a parallel vector field along ¥. 


Theorem 3.3.4 (Existence and uniqueness). Let I C R be an interval 
andy: I — M be a smooth curve. Let to € I and vp € Ty4,)M_ be given. 
Then there is a unique parallel vector field X € Vect(y) such that X (to) = vo. 


Proof. Choose a basis €1,...,€m of the tangent space T\(;,)M and let 
X1,..-,Xm € Vect(y) 
be vector fields along y such that 
Xi(io) = ea = Peewee 


(For example choose X;(t) := II(y(t))e;.) Then the vectors X;(to) are lin- 
early independent. Since linear independence is an open condition there is a 
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constant € > 0 such that the vectors X1(t),...,Xm(t) € Tyq)M are linearly 
independent for every t € Ip := (to —¢,to +e) NI. Since T,q)M is an 
m-dimensional real vector space this implies that the vectors X;(t) form a 
basis of T,q)M for every t € Ip. We express the vector VX;(t) € Tyq)M in 
this basis and denote the coefficients by a*(t) so that 


VXi(t) = S- af (t) Xz (2). 


k=1 


The resulting functions ak : [9 — R are smooth. Likewise, if X : J > R” is 
any vector field along y, then there are smooth functions €' : Ig + R such 
that 


X(t)=S°&()Xi(t) for all t € Ip. 


The derivative of X is given by 


Y (FOX + FOX) 


i=1 


X(t) 
and the covariant derivative by 


VX) = DO (EWX) +E OVX) 


= SOEMRO + SE) SY) oF) Xu (t) 
i=1 i=1 k=1 
= (&a+ Sate) X;(t) 
k=1 i=1 
for t € Ip. Hence V X(t) = 0 if and only if 
| a(t) a(t) 
E(t) + A()E(t) = 0, A(t) = 
ay" (t) am (t) 


Thus we have translated the equation VX = 0 over the interval Jp into a time 
dependent linear ordinary differential equation. By a theorem in Analysis IT 
(see [64, Lemma 4.4.3]), this equation has a unique solution for any initial 
condition at any point in Jo. Thus we have proved that every tg € I is 
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contained in an interval Jo C J, open in the relative topology of J, such 
that, for every t; € Jo and every v1 € Ty(z,)M, there exists a unique parallel 
vector field X : [9 + R” along y|;, satisfying X(t) = v1. We formulate this 
condition on the interval Jp as a logical formula: 


Vty E€ lp Vuz € Ty(t,)M AIX € Vect(7| 1) 
such that VX =0 and X(t1) = v1. 


(33:1) 


If two J-open intervals Jo, 4 C J satisfy this condition and have nonempty 
intersection, then their union Jog U J also satisfies (3.3.1). (Prove this!) Now 
define 


J = U {Io C I| Io is an I-open interval, Ip satisfies (3.3.1), to € Io}. 


This interval J satisfies (3.3.1). Moreover, it is nonempty and, by defi- 
nition, it is open in the relative topology of J. We prove that it is also 
closed in the relative topology of J. Thus let (t;);en be a sequence in J 
converging to a point t* € J. By what we have proved above, there ex- 
ists a constant ¢ > 0 such that the interval I* := (t* —¢,t* +¢) TI satis- 
fies (3.3.1). Since the sequence (t;);en converges to t*, there exists an i ¢ N 
such that ¢; € I*. Since t; € J there exists an interval [Ij C J, open in the 
relative topology of J, that contains to and ¢; and satisfies (3.3.1). Hence 
the interval Jp U J* is open in the relative topology of J, contains tp and ¢*, 
and satisfies (3.3.1). This shows that t* €J. Thus we have proved that the 
interval J is nonempty, and open and closed in the relative topology of I. 
Hence J = I and this proves Theorem 3.3.4. 


Definition 3.3.5 (Parallel transport). Let IC R be an interval and 
let y: 1 + M be a smooth curve. For to,t € I we define the map 


y(t, to) : Ly (t9)M > Ty()M 


by &y(t, to)vo := X(t) where X € Vect(y) is the unique parallel vector field 
along y satisfying X(to) = vo. The collection of maps ®,(t,to) for t,to € I 
is called parallel transport along y¥. 


Recall the notation 
¥TM = {(s,v)|s el, ve Ty)M} 


for the pullback tangent bundle. This set is a smooth submanifold of J x R”. 
(See Theorem 2.6.10 and Corollary 2.6.13.) The next theorem summarizes 
the properties of parallel transport. In particular, the last assertion shows 
that the covariant derivative can be recovered from the parallel transport 


maps. 
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Theorem 3.3.6 (Parallel transport). Let y: I — M be a smooth curve 
on an interval I CR. 


(i) The map ®,(t, 5) : Ty(s)M — Ty)M is linear for all s,t € I. 
(ii) For all r,s,t € I we have 


®,(t,s) 0 ®,(s,r) = ®y(t,r), ®,(t,t) = id. 
(iii) For all s,t € I and allv,w € Tys)M we have 
(®,(t, s)v, ®y(t, s)w) = (v, w). 


Thus ®,(t, 8): Ty(s)M — Ty)M is an orthogonal transformation. 
(iv) If J CR is an interval and o: J > I is a smooth map, then 


Broo(tss) = &,(o(t),0(s)). 


for all s,t Ee J. 
(v) The map 


Ix¥TM > TM : (t,(s,v)) + (¢, y(t, s)v) 


as smooth. 
(vi) For all X € Vect(y) and t,to € I we have 


© By(tost) X(t) = ®,(to, t)V X(t). 
Proof. Assertion (i) holds because the sum of two parallel vector fields 
along y is again parallel and the product of a parallel vector field with a 
constant real number is again parallel. Assertion (ii) follows directly from 
the uniqueness statement in Theorem 3.3.4. 

We prove (iii). Fix a number s € J and two tangent vectors 


V,WE Ly. 
Define the vector fields X,Y € Vect(y) along 7 by 
X(t) := ®,(t, s)u, Y(t) := ®,(t, s)w. 


These vector fields are parallel. Thus, by equation (3.2.5) in Lemma 3.2.4, 
we have 


eg) = (VX,Y) + (X, VY) =0. 


Hence the function J > R: t+ (X(t), Y(¢)) is constant and this proves (iii). 
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We prove (iv). Fix an element s € J and a tangent vector v € Tyg(s))M. 
Define the vector field X along 7 by 


X(t) := €,(t,a(s))v 
for t € I. Thus X is the unique parallel vector field along y that satisfies 
X(a(s)) =v. 
Denote 7 
VY:=yoo:J7M, X := Xoa:I—>R"” 


Then X is a vector field along ¥Y and, by the chain rule, we have 


Projecting orthogonally onto the tangent space T\(,(4))M we obtain 
VX(t) = a(t)VX(a(t)) =0 


for every t € J. Hence X is the unique parallel vector field along 7 that 
satisfies X(s) = v. Thus 


dx(t, s)v = X(t) = X(a(t)) = 0, (a(t), o(s))v. 


This proves (iv). 

We prove (v). Fix a point to € J, choose an orthonormal basis €1,..., Em 
of Ty(t)M, and define X;(t) := ®,(t,to)e: fort ¢ J andi=1,...,m. Thus 
X;, € Vect(7) is the unique parallel vector field along y such that X;(to) = e;. 
Then by (iii) we have 

(Xi(t), Xj(t)) = diy 


for all i,j € {1,...,m} and all t € I. Hence the vectors X;(t),..., Xm/(t) 
form an orthonormal basis of T,(;)M for every ¢ € I. This implies that, for 
each s € J and each tangent vector v € T),)M, we have 


v= >_(Xi(s),v) Xi(s). 


i=1 
Since each vector field X; is parallel it satisfies X;(t) = ®,(t, s) X;(s). Hence 


m 


b,(t,s)v = 5 -(Xi(s), v) Xi(t) (3.3.2) 


i=1 


for all s,t € I and v € T\,)M. This proves (v) 
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We prove (vi). Let X1,..., Xm € Vect(y) be as in the proof of (v). Thus 
every vector field X along y can be written in the form 


XH=VEOK), FO) = GH), XM). 
i=1 


Since the vector fields X; are parallel we have 


for all t € I. Hence 
(to, t)X(t) = SOE H)Xilto), By (to, VX (t) = 4 E(t) Xi(to). 
i=1 i=1 


Evidently, the derivative of the first sum with respect to t is equal to the 
second sum. This proves (vi) and the theorem. 


Remark 3.3.7. For s,¢ € J we can think of the linear map 
©, (t, s)E(7(s)) :R” + TyyM CR" 


as a real n X n matrix. The formula (3.3.2) in the proof of (v) shows that 


this matrix can be expressed in the form 


m 


,(t, s)M(7(s)) = $ > X,(t)Xi(s)" eR. 
i=l 


The right hand side defines a smooth matrix valued function on J x J and 
this is equivalent to the assertion in (v). 


Remark 3.3.8. It follows from assertions (ii) and (iii) in Theorem 3.3.6 
that 
®,(t,s) | = ©,(s,t) = ©,(¢, 8)" 


for all s,t € I. Here the linear map ®,(t,s)* : T(y)M — Ty.)M is under- 
stood as the adjoint operator of ®,(t, s) : Ts) M — T,4)M with respect to 
the inner products on the two subspaces of R” inherited from the Euclidean 
inner product on the ambient space. 
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The two theorems in this section carry over verbatim to any smooth 
vector bundle EF C M x R” over a manifold. As in the case of the tangent 
bundle one can define the covariant derivative of a section of F along y as 
the orthogonal projection of the ordinary derivative in the ambient space R” 
onto the fiber Ey”). Instead of parallel vector fields one then speaks about 
horizontal sections and one proves as in Theorem 3.3.4 that there is a unique 
horizontal section along y through any point in any of the fibers E,(;,). This 
gives parallel transport maps from E,(,) to Ey) for any pair s,¢ € J and 
Theorem 3.3.6 carries over verbatim to all vector bundles E C M x R”. We 
spell this out in more detail in the case where E = TM+ C M x R” is the 
normal bundle of M. 

Let y: 1 > M be a smooth curve. A normal vector field along y is 
a smooth map Y : J + R” such that Y(t) L T\z)M for every t € I. The set 
of normal vector fields along y will be denoted by 


Vectt(y) := {Y :I > R"|Y is smooth and Y(t) L T,q)M for allt € I}. 


This is again a real vector space. The covariant derivative of a normal 
vector field Y € Vect+(y) at t € I is defined as the orthogonal projection of 
the ordinary derivative onto the orthogonal complement of TM and will 
be denoted by 

V-Y(t) := (1- I(y(t))) YQ). (3.3.3) 


Thus the covariant derivative defines a linear operator 
Vi: Vect* (7) + Vectt(7). 


There is a version of the Gauf—Weingarten formula for the covariant deriva- 
tive of a normal vector field. This is the content of the next lemma. 


Lemma 3.3.9. Let M Cc R” be a smooth m-manifold. For p © M and 
u€T,M define the linear map hp(u) : T,M > Ti by 


hp(u)v := hp(u,v) = (dII(p)u)v (3.3.4) 


forv <¢T,M. Then the following holds. 
(i) The adjoint operator hp(u)* : T»M+ — T,M is given by 


hp(u)*w = (dII(p)u)w, weTpM-. (3.3.5) 


(ii) If I C R is an interval, y: I + M is a smooth curve, and Y € Vect*(y), 
then the derivative of Y satisfies the GauB—Weingarten formula 


Y(t) =V+Y(t) — hy (4(O)*¥ (0. (3.3.6) 
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Proof. Since II(p) € R"*" is a symmetric matrix for every p € M so is the 


matrix dII(p)u for every p € M and every u € T,M. Hence 


(v, hp(u)"w) = (Rp(u)v, w) 
— ((dII(p)u) v, w) 
= (v, (dIl(p)u)w) 


for every v € T,M and every w € T,M +. This proves (i). 
To prove (ii) we observe that, for Y € Vect+(y) and t € I, we have 


IT(y(t))¥ (t) = 0. 
Differentiating this identity we obtain 
TI(y(t))¥ (t) + (Ay (4) ()) ¥ (#) = 0 


and hence 


Y(t) Y(t) — M((é))¥ (6) — (A(t) HO) VO 


V~¥(t) — hyn GOP 


for t € I. Here the last equation follows from (i) and the definition of V~. 
This proves Lemma 3.3.9. 


Theorem 3.3.4 and its proof carry over to the normal bundle T7M~. 
Thus, if y: J — M is a smooth curve, then for all s € J and w € Ty(s)M* 
there is a unique normal vector field Y € Vect+(y) such that 


ViY=0, Y(s)=w. 
This gives rise to parallel transport maps 
L . 1 L 
2. (f,8) 2 Tys)M > Ty) M 


defined by 
cl a 
©, (t, s)w := Y(t) 


for s,t € Iand w € T,5)M + where Y is the unique normal vector field along 
7 satisfying VtY =0 and Y(s) = w. These parallel transport maps satisfy 
exactly the same conditions that have been spelled out in Theorem 3.3.6 
for the tangent bundle and the proof carries over verbatim to the present 


setting. 
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3.4 The Frame Bundle 


3.4.1 Frames of a Vector Space 


Let V be an m-dimensional real vector space. A frame of V is a basis 
€1,---,€m of V. It determines a vector space isomorphism e : R™ — V via 


e€ =o ee, PSOne VERS 
i=1 


Conversely, each isomorphism e : R™ — V determines a basis e1,...,€m 
of V via e; = e(0,...,0,1,0...,0), where the coordinate 1 appears in the 
ith place. The set of vector space isomorphisms from R™” to V will be 
denoted by 


Liso(R™, V) := {e : R™ > V |e is a vector space isomorphism} . 


The general linear group GL(m) = GL(m,R) (of nonsingular real m x m- 
matrices) acts on this space by composition on the right via 


GL(m) x Liso(R™,V) > Liso(R™,V) : (a,e) H a*e :=e04. 
This is a contravariant group action in that 
a*b*e = (ba)*e, I'e=e 


for a,b € GL(m) and e € Ligo(R™,V). Moreover, the action is free, i.e. for 
all a € GL(m) and e € Ligo(R™, V), we have 


a.e=e = gS 
It is transitive in that for all e,e’ € Liso(R™,V) there is a group element 
a € GL(m) such that e’ = a*e. Thus we can identify the space Liso(R™, V) 
with the group GL(m) via the bijection 
GL(m) > Ligo(R™, V) : a a*e 
induced by a fixed element eg € Ligo(R™”,V). This identification is not 


canonical; it depends on the choice of e9. The space Ligo(R™,V) admits a 
bijection to a group but is not itself a group. 
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3.4.2 The Frame Bundle 


Definition 3.4.1 (Frame bundle). Let M C R” be a smooth m-manifold. 
The frame bundle of M is the set 


F(M) := {(p,e)|pe M,e€ F(M)>}, (3.4.1) 
where F(M), is the space of frames of the tangent space at p, i.e. 
FM) 53= Leo( RTM). 
Define a right action of GL(m) on F(M) by 
a*(p,e) = (p,a*e) = (p,e 0a) (3.4.2) 
for a € GL(m) and (p,e) € F(M). 
One can think of a frame e € Liso(R™, T>M) as a linear map from R™ 
to R” whose image is J,M and hence as an n xX m-matrix of rank m. 
The basis of T,,M associated to this frame is given by the columns of the 
matrix e € R"*’™. Thus the frame bundle F(M) of an embedded mani- 
fold M Cc R” is a subset of the Euclidean space R” x R"*™. 
Lemma 3.4.2. The frame bundle 
FUN) CARY SIRs 
is a smooth manifold of dimension m+ m?, the group action 
GL(m) x F(M) > F(M) : (a, (p,e))  a*(p,e) 
is smooth, and the projection 
a: F(M)> M 


defined by m(p,e) :=p for (p,e) € F(M) is a surjective submersion. The 
orbits of the GL(m)-action on F(M) are the fibers of this projection, i.e. 


GL(m)*(p,e) = 7 "(p) = F(M)p 


for (p,e) € F(M), and the group GL(m) acts freely and transitively on each 
of these fibers. 
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Proof. Let U C M be an M-open set. A moving frame over U is a se- 
quence of m smooth vector fields E1,..., £m € Vect(U) on U such that the 
vectors Ej (p),...,Em(p) form a basis of T,M for each pe U. Any such 
moving frame gives a bijection 


U x GL(m) > F(U) : (p,a) + a*(p, E(p)) = (p, E(p) © a), 
where 
E(p) := (Ex (p),...; Em(p)) € F(M)p 


for p€ U. This bijection (when composed with a parametrization of U) 
gives a parametrization of the open set F(U) in F(M). The assertions of 
the lemma then follow from the fact that the diagram 


U x GL(m) F(U) 


commutes. More precisely, suppose that there exists a coordinate chart 
¢:U>2 
with values in an open set 2 C R™, and denote its inverse by 
w:=o':050. 
Then the open set 
F(U) =1-"(U) = {(p,e) € F(M) |p € U} = (U x R™") 1 F(M) 
is parametrized by the map 
Q x GL(m) + F(U) : (a, a) 4 (uv (2), dy (a) oa). 
This map is amooth and so is its inverse 
F(U) + Q x GL(m) : (p,e) + (6(p), do(p) oe). 


These are the desired coordinate chart on F(M). Thus F(M) is a smooth 
manifold of dimension m +m. Moreover, in these coordinates the projec- 
tion 7: F(U) > U is the map 2 x GL(m) >: (z,a) 4 x and so 7 is a 
submersion. The remaining assertions follow directly from the definitions 
and this proves Lemma 3.4.2. 
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The frame bundle F(M) is a principal bundle over M with struc- 
ture group GL(m). More generally, a principal bundle over a manifold B 
with structure group G is a smooth manifold P equipped with a surjective 
submersion 7: P + B and a smooth contravariant action 


Gx P>P:(g,p) > pg 


by a Lie group G such that a(pg) = 7(p) for all p € P and g € G and such 
that the group G acts freely and transitively on the fiber P, := 1~!(b) for 
each b € B. In this book we shall mostly be concerned with the frame bundle 
of a manifold M and the orthonormal frame bundle. 


Definition 3.4.3. The orthonormal frame bundle of M is the set 
OM) = { (p,e) € R" x R"™™ |p € M, ime = T,M, e'e = Tnx f ; 


If we denote by e; := e(0,...,0,1,0,...,0) (with 1 as the ith argument) the 
basis of T,M induced by the isomorphism e: R™ —+ T,M, then we have 


T €1,---,€m 1S an 
ee=l = €;,€;) = 6;; = 
(ei, €3) ‘ orthonormal basis. 


Thus O(M) is the bundle of orthonormal frames of the tangent spaces T,M 
or the bundle of orthogonal isomorphisms e: R™ + T,M. It ts a principal 
bundle over M with structure group O(m). 


Exercise 3.4.4. Prove that O(M) is a submanifold of F(/) and that the 
obvious projection 7 : O(M) > M is a submersion. Prove that the action 
of GL(m) on F(M) restricts to an action of the orthogonal group O(m) 
on O(M) whose orbits are the fibers 


O(M)» := {e € R"™*™ | (p,e) € O(M)} 
= {e € Ligo(R™, TpM) | e'e = 1}. 


Hint: If 6: U > 2 is a coordinate chart on M with inverse yw: — U, 
then 


ex = dyp(x)(dp(a)"dp(a))“/? | R™ + Tyg) M 


is an orthonormal frame of the tangent space Ty,)M for every x € (2. 


3.4. THE FRAME BUNDLE 141 


3.4.3 Horizontal Lifts 


We have seen in Lemma 3.4.2 that the frame bundle F(M) is a smooth 
submanifold of R” x R"*™. Next we examine the tangent space of F(M) at 
a point (p,e) € F(M). By Definition 2.2.1, this tangent space is given by 


R—- F(M) :t (y(t), e(t)) 
is a smooth curve satisfying 
(0) = p and e(0) =e 


pe)F (M) = ¢ (4(0), e(0)) 


The next lemma gives an explicit formula for this tangent space in terms of 
the second fundamental form h, : T,M x T,M — T,M* in Definition 3.1.9. 
Compare this formula with Lemma 4.3.1 in the next chapter. 


Lemma 3.4.5. Let M C R” be a smooth m-dimensional submanifold. Then 
the tangent space of F(M) at (p,e) is given by 


Tipe) F (M) = {G a) (3.4.3) 


te aaa ae, 


(1—1(p))e = 


Proof. We prove the inclusion “C” in (3.4.3). Let (p,@) € Tipe) F(M) and 
choose a smooth curve R > F(M) : t+ (y(t), e(t)) such that 


VO)=p, eO)=e, 70) = e(0) =e. 
Fix a vector € € R™ and define the vector field X € Vect(y) by X(t) := e(t)€ 
for t € R. Then the Gauf—Weingarten formula (3.2.2) asserts that 
e(t)é = X(t) 
= VX(t) + hy (V(t), X (4) 
= W(y(t) )e()E + hay (V(t), e(E)E) 
for allt € R. Take t = 0 to obtain 


(1 — I(p) eg = hp, €€) = hp(p)e€ 


for all € € R™. This proves the inclusion “Cc” in (3.4.3). Equality holds 
because both sides of the equation are (m+m7)-dimensional linear subspaces 
of R” x R"*™. This proves Lemma 3.4.5. 


It is convenient to consider two kinds of curves in F(M), namely vertical 
curves with constant projections to M and horizontal lifts of curves in M. 
We denote by £(R™,7,M) the space of linear maps from R™ to T,M. 
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Definition 3.4.6 (Horizontal lift). Let y: R— M be a smooth curve. A 
smooth curve B:R— F(M) is called a lift of y iff 


Tmop=y. 
Any such lift has the form B(t) = (y(t), e(t)) with e(t) € Liso(R™, Tyg) M). 


The associated curve of frames e(t) of the tangent spaces T,4)M is called a 
moving frame along y. A curve 


B(t) = (y(), e(t)) € F(M) 


is called horizontal or a horizontal lift of 7 iff the vector field 


along y is parallel for every € € R™. Thus a horizontal lift of y has the form 


B(t) = (y@), ®y(¢, Oe) (3.4.4) 
for some e€ € Ligo(R™, Tyo) M). 


Lemma 3.4.7. (i) The tangent space of F(M) at (p,e) € F(M) is the direct 
sum 


Tp,e)F (M) = Hope) ® Vine) 
of the horizontal space 
Hye) := {(v, hp(v)e) |v € TM} (3.4.5) 
and the vertical space 
Vp,e) = {0} x L(R™, Ty). (3.4.6) 
(ii) The vertical space Vip) at (p,e) € F(M) is the kernel of the linear map 


dr(p,e) : Tipe)F(M) > TyM. 
(iii) A curve B:R— F(M) is horizontal if and only if it is tangent to the 
horizontal spaces, t.e. B(t) € Hg) for everyt € R. 


(iv) If 8: R— F(M) is a horizontal curve, so is a*8 for every a € GL(m). 
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Proof. The proof has four steps. 
Step 1. Let (p,e) € F(M). Then Vip.-) = ker dr(p,e) C Tipe) F(M). 
Since 7 is a submersion, the fiber 7~!(p) is a submanifold of F(M) by 
Theorem 2.2.19 and Ty,.)m | (p) = ker dr(p,e). Now let (p,@) € ker dr(p,e). 
Then there exists a vertical curve 3 : R > F(M) with 7 o 6 = p such that 
B(0) =(p,e), — B(0) = @e). 
Any such curve has the form ((t) := (p, e(t)) where e(t) € Liso(R™, TpM). 
Hence p = 0 and € = é(0) € £(R™,T,M). This shows that 
ker dr(p,e) C Vine): (3.4.7) 
Conversely, for every € € C(R™,T,M), the curve 
R > £(R”,T,M) : tr e(t) :-=e+te 

takes values in the open set Liso(R™, TpM) for ¢ sufficiently small and 
hence 6(t) := (p,e(t)) is a vertical curve with 6(0) = (0,é). Thus 

Vigey C ker dr(p,e) C T(p,e)F (M). (3.4.8) 
Combining (3.4.7) and (3.4.8) we obtain Step 1 and part (ii). 


Step 2. Let (p,e) € F(M). Then Hye) C Tip e)F(M). Moreover, every 
horizontal curve 8: R— F(M) satisfies B(t) € Hgq) for allt € R. 


Fix a tangent vector v € T,M, let y: R-— M be a smooth curve satisfy- 
ing y(0) = p and 4(0) = v, and let 8 : R > F(M) be the horizontal lift of 7 
with 6(0) = (p,e). Then 

B(t) = (v(t), e()), —e(t) = By (¢, Oe. 


Fix a vector € € R™ and consider the vector field 
X(t) := e(t}€ = ,(t, 0)ee 


along y. This vector field is parallel and hence, by the Gauf—Weingarten 
formula, it satisfies 


6(0)E = X(0) = hyio)(4(0), X (0) = hp(w)eé. 
Here we have used (3.3.4). Thus 


(v, hp(w)e) = (F(0), €(0)) = BO) € Tao) F(M) = Tepe) F(M). 


and so Hip.) C Tip.)F(M). Moreover, B(0) = (v, Ap(v)e) € Aye) = Hg(0) 
and this proves Step 2. 
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Step 3. We prove part (i). 


We have Vine) C Tipe) F(M) by Step 1 and Hy ~) C Tipe) F(M) by Step 2. 
Moreover Hip ¢) 1 Vip.e) = {O} and so Typ 6) F(M) = Hip) & Vip.e) for dimen- 
sional reasons. This proves Step 3. 

Step 4. We prove parts (iii) and (iv). 

By Step 2 every horizontal curve 8 : R > F(M) satisfies B(t) € Hg). Con- 
versely, let R > F(M): t+ B(t) = (y(t), e(t)) be a smooth curve satisfy- 
ing A(t) € Hz) for all t. Then é(t) = hyq(¥(t))e(¢) for all t. By the Gau8— 
Weingarten formula (3.2.2) this implies that the vector field X(t) = e(t)& 
along ¥ is parallel for every € € R™, so 8 is horizontal. This proves part (iii). 
Part (iv) follows from (iii) and the fact that the horizontal tangent bun- 
dle H C TF(M) is invariant under the induced action of the group GL(m) 
on T¥(M). This proves Lemma 3.4.7. 


F(M)p=" (p) 
FM) 
(p,e) 
T 
M p 


Figure 3.3: The frame bundle. 


The reason for the terminology introduced in Definition 3.4.6 is that one 
draws the extremely crude picture of the frame bundle displayed in Fig- 
ure 3.3. One thinks of F(/) as “lying over” MM. One would then represent 
the equation y=70( by the following commutative diagram: 


FU); 
B 7 
ain 


hence the word “lift”. The vertical space is tangent to the vertical line in 
Figure 3.3 while the horizontal space is transverse to the vertical space. This 
crude imagery can be extremely helpful. 
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Exercise 3.4.8. The group GL(m) acts on F(M) by diffeomorphisms. Thus 
for each a € GL(m) the map 


F(M) — F(M) : (p,e) + a*(p,€) = (p, € 0 a) 
is a diffeomorphism of F(M). The derivative of this diffeomorphism is a 
diffeomorphism of the tangent bundle TF(M) and this is called the induced 
action of GL(m) on TF(M). Prove that the horizontal and vertical sub- 
bundles are invariant under the induced action of GL(m) on TF(M). 
Exercise 3.4.9. Prove that H(p.) C T(p,-)O(M) and that 
Tpe)O(M) = Hee) ® Vine)» — Vone) = Vine) Tey (M), 

for every (p,e) € O(M). 

The following definition introduces an important class of vector fields on 
the frame bundle that will play a central role in Section 3.5. They will be 
used to prove the Development Theorem 3.5.21 in §3.5.4 below. 


Definition 3.4.10 (Basic vector field). Every vector € € R™ determines 
a vector field Be € Vect(F(M)) defined by 


Be(p,e) := (€€, hp(e€)e) (3.4.9) 
for (p,e) € F(M). This vector field is horizontal, i.e. 
Be(p, e) € pe); 


and projects to e€, 1.e. 

dr(p, e) Be(p, e) = e€ 
for all (p,e) € F(M). These two conditions determine the vector field Be 
uniquely. It is called the basic vector field corresponding to €. 
Exercise 3.4.11. (i) Prove that every basic vector field Be € Vect(F(M)) 
is tangent to the orthonormal frame bundle O(/). 
(ii) Let R > F(M) : t + (9(t), e(t)) be an integral curve of the vector 
field Be and a € GL(m). Prove that R > F(M) : t+ a*B(t) = (y(t), a*e(t)) 
is an integral curve of By-1¢. 
(iii) Prove that the vector field Be € Vect(F(M)) is complete for all € € R™ 
if and only if the restricted vector field Belocas) € Vect(O(M)) on the or- 
thonormal frame bundle is complete for all € € R”. 
Definition 3.4.12 (Complete manifold). A smoth m-manifold M Cc R" 
is called complete iff, for every smooth curve € : R — R™” and every 
element (po,eo) € F(M), there exists a smooth curve B: R— F(M) such 
that B(0) = (po, eo) and B(t) = Bey (B(t)) for allt € R. 
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3.5 Motions and Developments 


Our aim in this sections is to define motion without sliding, twisting, or 
wobbling. This is the motion that results when a heavy object is rolled, 
with a minimum of friction, along the floor. It is also the motion of the large 
snowball a child creates as it rolls it into the bottom part of a snowman. 

We shall eventually justify mathematically the physical intuition that 
either of the curves of contact in such ideal rolling may be specified arbi- 
trarily; the other is then determined uniquely. Thus for example the heavy 
object may be rolled along an arbitrary curve on the floor; if that curve is 
marked in wet ink, another curve will be traced in the object. Conversely, 
if a curve is marked in wet ink on the object, the object may be rolled so as 
to trace a curve on the floor. However, if both curves are prescribed, it will 
be necessary to slide the object as it is being rolled, if one wants to keep the 
curves in contact. 

We assume throughout this section that M and M’ are two m-dimensio- 
nal submanifolds of R”. Objects on M’ will be denoted by the same letter 
as the corresponding objects on M with primes affixed. Thus for example 
II'(p’) € R"*" denotes the orthogonal projection of R” onto the tangent 
space TM’, V’ denotes the covariant derivative of a vector field along a 
curve in M’, and 1, denotes parallel transport along a curve in M’. 


3.5.1 Motion 


Definition 3.5.1. A motion of M along M’ (on an interval J C R) is 
a triple (V, 7,7) of smooth maps 


UV: I> O(n), y:IT>M, : I> M’' 


such that 
W(t)Tya)M = Ty (ty M’ Vtel. 


Note that a motion also matches normal vectors, i.e. 
W(t)TyyyMt=TyyM' Vtel. 


Remark 3.5.2. Associated to a motion (WV, y, 7’) of M along M’ is a family 
of (affine) isometries 7, : R” — R” defined by 


W(p) = 7'(t) + V(t) (p — y(t) (3.5.1) 


for t € J and p € R”. These isometries satisfy 


WOM) =1O, dh O)TyyM =TyyM’  VteL. 
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Remark 3.5.3. There are three operations on motions. 
Reparametrization. If (V,y,7’) is a motion of M along M’ on an in- 
terval JC R and 0 : J > I is a smooth map between intervals, then the 
triple 

(Voa,yo0,7 00) 
is a motion of M along M’ on the interval J. 


Inversion. If (V,y,7’) is a motion of M along M’, then 
(Uy) 


is a motion of M’ along M. 


Composition. If (V,7,7’) is a motion of M along M’ on an interval I 
and (W’, 7,7") is a motion of M’ along M” on the same interval, then 


(Uv, 7,7") 
is a motion of M along M”. 


We now give the three simplest examples of “bad” motions; i.e. motions 
which do not satisfy the concepts we are about to define. In all three of 
these examples, p is a point of M and M’ is the affine tangent space to M 
at p: 

M' :=p+T,M ={p+v|veET,M}. 


Example 3.5.4 (Pure sliding). Take a nonzero tangent vector v € T,M 
and let 


nt) =, '(t) =pt+tv, V@) = 
Then ¥(t) = 0, 4/(t) = v £0, and so 
WH) AC). 


(See Figure 3.4.) 


Figure 3.4: Pure sliding. 
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Example 3.5.5 (Pure twisting). Let y and 7’ be the constant curves 


and take W(t) to be the identity on T,M + and any curve of rotations on the 
tangent space T,M/. As a concrete example with m = 2 and n = 3 one can 
take M to be the sphere of radius one centered at the point (0,1,0) and p 
to be the origin: 


M := {(2,y,z) € R?|2? + (y—1)? +22 =1}, p := (0,0, 0). 


Then M’ is the (z, z)-plane and A(t) is any curve of rotations in the (2, z)- 
plane, i.e. about the y-axis TM. (See Figure 3.5.) 


CT 


Pp 


M’ 


Figure 3.5: Pure twisting. 


Example 3.5.6 (Pure wobbling). This is the same as pure twisting except 
that W(t) is the identity on T,M and any curve of rotations on T,M+. As 
a concrete example with m = 1 and n = 3 one can take M to be the circle 
of radius one in the (2, y)-plane centered at the point (0,1,0) and p to be 
the origin: 


M := {(a,y,0) € R®|2?+(y-1)? =1}, p := (0,0, 0). 


Then M’ is the x-axis and W(t) is any curve of rotations in the (y, z)-plane, 
ie. about the axis M’. (See Figure 3.6.) 


M’ Pp 


Figure 3.6: Pure wobbling. 
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3.5.2 Sliding 


When a train slides on the track (e.g. in the process of stopping suddenly), 
there is a terrific screech. Since we usually do not hear a screech, this means 
that the wheel moves along without sliding. In other words the velocity of 
the point of contact in the train wheel M equals the velocity of the point 
of contact in the track M’. But the track is not moving; hence the point of 
contact in the wheel is not moving. One may explain the paradox this way: 
the train is moving forward and the wheel is rotating around the axle. The 
velocity of a point on the wheel is the sum of these two velocities. When 
the point is on the bottom of the wheel, the two velocities cancel. 


Definition 3.5.7. A motion (V,+7,7') is said to be without sliding iff it 
satisfies U(t)¥(t) = ¥'(t) for every t. 

Here is the geometric picture of the no sliding condition. As explained 
in Remark 3.5.2 we can view a motion as a smooth family of isometries 


be(p) = 9'(t) + Ut) (p — (2) 
acting on the manifold M with y(t) € M being the point of contact with M’. 
Differentiating the curve t ++ y(p) which describes the motion of the point 
p €M in the space R” we obtain 


Sail) = 1) — WHE) + HOH - 110). 
Taking p = y(to) we find 


d 


ai|., velr(to)) = 4 (to) — (to) V(to). 


t=to 
This expression vanishes under the no sliding condition. In general the 
curve t ++ ¥(7(to)) will be non-constant, but (when the motion is without 
sliding) its velocity will vanish at the instant t = to; i.e. at the instant when 
it becomes the point of contact. In other words the motion is without sliding 
if and only if the point of contact is motionless. 

We remark that, if the motion is without sliding, we have: 


7] = WO! = HO! 


so that the curves 7 and ¥’ have the same arclength: 


i‘ Lv (@)| at = [ l(t)| at 


on any interval [to,ti] C J. Hence any motion with 7 = 0 and ¥/ 4 0 is not 
without sliding (such as the example of pure sliding above). 
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Exercise 3.5.8. Give an example of a motion where |¥(t)| = |*4(t)| for 
every t but which is not without sliding. 


Example 3.5.9. We describe mathematically the motion of the train wheel. 
Let the center of the wheel move right parallel to the z-axis at height one 
and the wheel have radius one and make one revolution in 27 units of time. 
Then the track M’ is the x-axis and we take 


M := {(z,y) € R’|2?+(y-1)? =1}. 
Choose 


y(t) := (cos(t — 7/2), 1+ sin(t — 7/2)) 
= (sin(t), 1 — cos(t)), 
7'(t) = (t,0), 


and define Y(t) € GL(2) by 


v(t) =| cos(t)  sin(t) ). 


—sin(t) cos(t) 


The reader can easily verify that this is a motion without sliding. A fixed 
point po on M, say po = (0,0), sweeps out a cycloid with parametric equa- 
tions 

x =t—sin(é), y = 1-cos(t). 


(Check that («, y) = (0,0) when y = 0; ie. for t = 2nz.) 


Remark 3.5.10. These same formulas give a motion of a sphere M rolling 
without sliding along a straight line in a plane M’. Namely in coordinates 
(x,y,z) the sphere has equation 


a? + (y—1)*+27=1, 


the plane is y = 0 and the line is the z-axis. The z-coordinate of a point is 
unaffected by the motion. Note that the curve y’ traces out a straight line 
in the plane M’ and the curve ¥ traces out a great circle on the sphere M. 


Exercise 3.5.11. The operations of reparametrization, inversion, and com- 
position respect motion without sliding; i.e. if (WV, 7,7) and (W’,7’, 7”) are 
motions without sliding on an interval J and a: J > I is a smooth map 
between intervals, then the motions (Vo a,yoa,7 0a), (W~!,7’,7), and 
(WW, y,y") are also without sliding. 
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3.5.3 Twisting and Wobbling 


A motion (V,7¥,7) on an intervall J C R transforms vector fields along y 
into vector fields along y’ by the formula 


X'(t) = (WX) (t) = VE)X(t) € Ty) M’ 
for t € I and X € Vect(y); so X’ € Vect(7). 


Lemma 3.5.12. Let (V,y,7') be a motion of M along M' on an interval 
ICR. Then the following are equivalent. 


(i) The instantaneous velocity of each tangent vector is normal, i.e. fort € I 
b(t)Tyu)M C Tyw)M". 
(ii) U intertwines covariant differentiation, i.e. for X © Vect(y) 
V' (UX) = VX. 
(iii) UV transforms parallel vector fields along y into parallel vector fields 
along y’, i.e. for X € Vect(y) 
Vx=0 = V'(WX) =0. 
(iv) W intertwines parallel transport, i.e. for s,t € I and v € Tys)M 
V(t) ®,(t, s)v = B1,(t, 8) U(s)v. 
A motion that satisfies these conditions is called without twisting. 
Proof. We prove that (i) is equivalent to (ii). A motion satisfies the equation 
w(NT(Y(t)) = W/V) 


for every t € I. This restates the condition that U(t) maps tangent vectors 
of M to tangent vectors of M’ and normal vectors of M to normal vectors 
of M’. Differentiating the equation X’(t) = U(t)X(t) we obtain 


X'(t) = V(t) X(t) + W(t) X(t). 
Applying H’(9‘(t)) this gives 
VX! = UWVX + II'(y/) UX. 


Hence (ii) holds if and only if II’(y‘(t)) W(t) = 0 for every t € I. Thus we 
have proved that (i) is equivalent to (ii). That (ii) implies (iii) is obvious. 
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We prove that (iii) implies (iv). Let to € I and v9 € Ty¢,)M. Define the 
vector field X € Vect(y) by X(t) := ®,(t,to)vo for t € I and let X’:= UX. 
Then VX = 0, hence V’X’ = 0 by (iii), and hence 


X"(t) = G1,(t, to) X"(to) = ®1,(t, to) U(to)v0 


for all t € I. Since X'(t) = W(t) X(t) = V(t) ®,(t, to) vo, this implies (iv). 
We prove that (iv) implies (ii). Let X € Vect(y) and X’ := UX. By (iv) 
we have 
Py (to, t) X"(t) = W (to) Py (to, t)X (4). 


Differentiating this equation with respect to t at t = t) and using Theo- 
rem 3.3.6, we obtain V/X'(to) = U(to) VX (to). This proves the lemma. 


Lemma 3.5.13. Let (V,y,7') be a motion of M along M' on an interval 
ICR. Then the following are equivalent. 


(i) The instantaneous velocity of each normal vector is tangent, t.e. fort € I 


U(t)T. 


all 
y(t) © Ty) M’. 


(ii) UY intertwines normal covariant differentiation, i.e. for Y € Vectt (7) 
V'" (WY) = wV"y. 


(iii) UV transforms parallel normal vector fields along y into parallel normal 
vector fields along ¥', i.e. for Y € Vect+(7) 


Vty =0 = Vt (WY) =0. 


(iv) W intertwines parallel transport of normal vector fields, i.e. for s,t € I 
and w € Tigi 


V(t) + (t, s)w = Oy, (t, s)U(s)w. 


A motion that satisfies these conditions is called without wobbling. 


The proof that the four conditions in Lemma 3.5.13 are equivalent is 
word for word analogous to the proof of Lemma 3.5.12 and will be omitted. 

In summary a motion is without twisting iff tangent vectors at the point 
of contact are rotating towards the normal space and it is without wobbling 
iff normal vectors at the point of contact are rotating towards the tangent 
space. In case m = 2 and n = 3 motion without twisting means that the 
instantaneous axis of rotation is parallel to the tangent plane. 
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Remark 3.5.14. The operations of reparametrization, inversion, and com- 
position respect motion without twisting, respectively without wobbling; i.e. 
if (VW, 7,7) and (W’, 7’, y”) are motions without twisting, respectively with- 
out wobbling, on an interval J and o : J > I is a smooth map between 
intervals, then the motions (Vog,yoa,y'oa), (W~!,7,7), and (W’V, 7, 7) 
are also without twisting, respectively without wobbling. 


Remark 3.5.15. Let J C R be an interval and to € J. Given curves 
y:I— M and 7:I— M’ and an orthogonal matrix Vo € O(n) such that 
WoL (i) M = Ty (tM 
there is a unique motion (W,y,~7’) of M along M’ (with the given y and 7’) 
without twisting or wobbling satisfying the initial condition: 

UV (to) = Vo. 


Indeed, the path of matrices UV : I — O(n) is uniquely determined by the 
conditions (iv) in Lemma 3.5.12 and Lemma 3.5.13. It is given by the explicit 
formula 


U(t)v = ©, (t, to) Vo®, (to, t)M(y(t))v 
+ 8%, (t, to) Vob+ (to, t)(v — I(7(t))v) 
for t € J and v € R”. We prove below a somewhat harder result where the 


motion is without twisting, wobbling, or sliding. It is in this situation that + 
and 7 determine one another (up to an initial condition). 


(3.5.2) 


Remark 3.5.16. We can now give another interpretation of parallel trans- 
port. Given 7: R > M and v € Ty.) M take M’ to be an affine subspace of 
the same dimension as M. Let (W, 7,7’) be a motion of M along M’ without 
twisting (and, if you like, without sliding or wobbling). Let X’ € Vect(7’) 
be the constant vector field along y' (so that V’X' = 0) with value 


X"(t) = Vovo, Vo := V(to). 
Let X € Vect(y) be the corresponding vector field along y so that 
W(t) X(t) = Vovo 


Then X(t) = ®,(t,to)vo. To put it another way, imagine that M is a ball. To 
define parallel transport along a given curve ¥ roll the ball (without sliding) 
along a plane M’ keeping the curve y in contact with the plane M’. Let 9/ 
be the curve traced out in M’. If a constant vector field in the plane M’ is 
drawn in wet ink along the curve 7’, it will mark off a (covariant) parallel 
vector field along y in M. 


Exercise 3.5.17. Describe parallel transport along a great circle in a sphere. 
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3.5.4 Development 
A development is an intrinsic version of motion without sliding or twisting. 


Definition 3.5.18. A development of VM along M’ (on an interval /) 
is a triple (®,y,7/) where y: I > M andy: I > M' are smooth paths 
and ® is a family of orthogonal isomorphisms 


(t): Ty()M > Ty (ty M’ 
parametrized by t € I, such that 
B(t)7(t) = ¥'() (3.5.3) 
for allt € I and ® intertwines parallel transport, i.e. 
O(t) b(t, s) = ®1,(t, s)®(s) (3.5.4) 


for all s,t € I. In particular, the family ® of isomorphisms is smooth, i.e. 
if X is a smooth vector field along y, then the formula X‘(t) := ®(t)X(t) 
defines a smooth vector field along 7’. 


Lemma 3.5.19. Let I C R be an interval, y: I > M and7:I — M' 
be smooth curves, and ®(t) : TyjM > Ty) M' be a family of orthogonal 
isomorphisms parametrized by t © I. Then the following are equivalent. 

(i) (®, 7,7) is a development. 

(ii) ® satisfies (3.5.3) and 


V'(@X) = 6VX (3.5.5) 


for all X € Vect(7). 


(iii) There exists a motion (WV, ,7') without sliding and twisting such that 
O(t)=V()|r.u for allt €T. (3.5.6) 


(iv) There exists a motion (V,y,7') of M along M’ without sliding, twisting, 
and wobbling that satisfies (3.5.6). 


Proof. That (3.5.4) is equivalent to (3.5.5) was proved in Lemma 3.5.12. 
This (i) is equivalent to (ii). That (iv) implies (iii) and (iii) implies (i) is 
obvious. To prove that (i) implies (iv) choose any to € J and any orthogonal 
matrix Yo € O(n) such that Volz... = @(to) and define W(t) : R” > R” 
by (3.5.2). This proves Lemma 3.5.19. 
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Remark 3.5.20. The operations of reparametrization, inversion, and com- 
position yield developments when applied to developments; i.e. if (®, y, 7’) is 
a development of M along M’ on an interval I, (®’, 7’, 7) is a development 
of M’ along M” on the same interval I, and a : J — J is a smooth map of 
intervals, then the triples 


(Boo,you,yoa),  (8',7',9)), (88, 7,9”) 
are all developments. 


Theorem 3.5.21 (Development Theorem). Let po € M and to ER, 
let y': R— M' be a smooth curve, and let 


Po : Tp) M — Ty (to) M’ 


be an orthogonal isomorphism. Then the following holds. 


(i) There exists a development (®,y,7|r) on some open interval I Cc R 
containing to that satisfies the initial condition 


y(to) =po, &(to) = Bo. (3.5.7) 


(ii) Any two developments (®1,71,7'|1,) and (®2,72,'|1,) as in (i) on two 
intervals I, and Ig agree on the intersection I, MN In, i.e. 


y(t) = y(t), $1 (t) = &9(¢) 


for everyt € NI. 
(iii) If M is complete, then (i) holds with I=R. 


Proof. Let y :.R — M be any smooth curve such that 
¥(to) = Po 
and, for t € R, define the linear map 
@(t) = TyyyM > Ty (tM 


by 

P(t) = O1,(t, to) BoB, (to, t). (3.5.8) 
This is an orthogonal transformation for every ¢ and it intertwines parallel 
transport. However, in general ©(t)¥(t) will not be equal to 4/(t). 
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To construct a development that satisfies (3.5.3), we choose an orthonor- 
mal frame eo : R™ — T,.M and, for t € R, define e(t) : R” > T,q)M by 


e(t) — ®,(t, to)eo. (3.5.9) 
We can think of e(t) as a real n x m-matrix and the map 
R—->R"™”™ :t e(t) 


is smooth. In fact, the map t +> (7(t), e(t)) is a smooth path in the frame 
bundle F(M). Define the smooth map € : R > R™ by 


¥/ (t) = B,(t, to) Boen€(t). (3.5.10) 


We prove the following. 


Claim: The triple (®,7,7) is a development on an interval I Cc R if and 
only if the path t+ (y(t), e(t)) satisfies the differential equation 


(7(¢), E(t) = Bec) (1(#), e(t)) (3.5.11) 


for every t € I, where Bey) € Vect(F(M)) denotes the basic vector field 
associated to €(t) € R™ (see equation (3.4.9) ). 


The triple (®, 7,7’) is a development on J if and only if 


B(t)4(t) = 7) 


for every t € I. By (3.5.8) and (3.5.10) this is equivalent to the condition 


D,(t, to) BoB, (to, Y(t) = ¥'(t) = BY, (t, to) Boeok(t), 


hence to 
®,(to, t)¥(t) = eo€(t), 
and hence to 
Y(t) = ®y(t, to)eog(t) = e(t)E(t) (3.5.12) 


for every t € I. By (3.5.9) and the Gauf—Weingarten formula, we have 


é(t) = hy (F(t))e(t) 


for every t € R. Hence it follows from (3.4.9) that (3.5.12) is equivalent 
to (3.5.11). This proves the claim. a 

Parts (i) and (ii) follow directly from the claim. Part (iii) follows from 
the claim and Definition 3.4.12. This proves Theorem 3.5.21. 
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Remark 3.5.22. As any two developments (®1, 71, 7'|r,) and (®2, 72, y'|z) 
on two intervals J; and Ig that satisfy the initial condition (3.5.7) agree 
on 1; 11, there is a development defined on Jy; UJy. Hence there is a 
unique mazimally defined development (®,7,7y‘|r), defined on a maximal 
interval I = I(to, po, ®o), associated to the initial data to, po, Bo. 

Denote the space of initial data by 


; / 
fe {(te ®) ne fenieetCag } : er) 
define the set D C R x P by 
D := {(t, to, po, ®o) | (to, Po, Bo) € P, t € I(to, po, Bo) } (3.5.14) 
and let 
D +P: (t,to, po, ®o) + (t, y(t), B(t)), (3.5.15) 


be the map which assigns to each (to, po, ®o) € P and each t € I(t, po, Bo) 
the value at time t of the unique development (®,7,7|r) associated to the 
inital condition (to, p90, ®o) on the maximal time interval I = I(to, po, ®o). 
Then the space P has a natural structure of a smooth manifold (in the 
intrinsic setting), and it follows from Theorem 2.4.9 and the proof of Theo- 
rem 3.5.21 that D is an open subset of R x P and the map (3.5.5) is smooth. 
The smooth structure on P can be understood as follows. The space 


O(y') = {(t,e) | (o/(), e’) € OC’) } 


is the pullback of the orthonormal frame bundle O(M’) — M’ under the 
curve y': R + M’ or, equivalently, is the orthonormal frame bundle of the 
pullback tangent bundle (7/)*7M. Thus O(7’) is a smooth submanifold 
of R x R"*™. The group O(m) acts diagonally on O(7’) x O(M) and the 
action is free. Hence the quotient (O(97’) x O(M))/O(m) is a smooth man- 
ifold by Theorem 2.9.14, and it can be naturally identified with P via the 
bijection [(t, e’), (p,e)] + (t,p,e’ oe7!). 


Remark 3.5.23. The statement of Theorem 3.5.21 is essentially symmetric 
in M and M’ as the operation of inversion carries developments to develop- 
ments. Hence given 


y:R-> M, py €& MM, to €R, Bo: Ty(t,)M > Ty M’, 


we may speak of the development (®, 7, y’) corresponding to y with initial 
conditions 7 (to) = pp and ®(tp) = Bo. 


158 CHAPTER 3. THE LEVI-CIVITA CONNECTION 


Corollary 3.5.24 (Motions). Let po € M and to ER, let 7/: R- M’ be 
a smooth curve, and let Vo € O(n) be a matrix such that 


WoT M = T(tg) MM". 


Then the following holds. 


(i) There exists a motion (V,7,7|r) without sliding, twisting and wobbling 
on some open interval I C R containing to that satisfies the initial condi- 
tion (to) = Po and WV (to) — Wo. 

(ii) Any two motions as in (i) on two intervals I; and Iz agree on the 
intersection I, N Iz. 


(iii) If M is complete, then (i) holds with I=R. 


Proof. Theorem 3.5.21 and Remark 3.5.15. 


Corollary 3.5.25 (Completeness). The following are equivalent. 


(i) M is complete, i.e. for every smooth curve € : R > R™” and every 
element (po,eo) € F(M), there exists a smooth curve 8: R— F(M) such 
that B(t) = Bey) (8(t)) for allt € R and B(0) = (po, eo) (Definition 3.4.12). 
(ii) For every smooth curve € : R + R™ and every element (po, eo) € O(M), 
there is a smooth curve a: R— O(M) such that a(t) = Bey (a(t)) for ev- 
ery t € R and a(0) = (po, eo). 


(iii) For every smooth curve 7/ : R > R™, every po € M, and every orthogo- 
nal isomorphism ®o : Tp, M — R™ there exists a development (®, 7,7) of M 
along M' = R"™ on all of R that satisfies 7(0) = po and ®(0) = Bo. 


Proof. We have already noted that the basic vector fields are all tangent to 
the orthonormal frame bundle O(M) Cc F(M). Now note that if a smooth 
curve I > F(M):t+ 6(t) = (y(t), e(t)) on an interval I C R satisfies the 
differential equation 


B(t) = Be (B(E)) 


for all t, then so does the curve 
I+ F(M) +> a*B(t) = (y(t), e(t) 0a) 


for every a€ GL(m,R). Since any frame eg: R™ > T,,M can be car- 
ried to any other (in particular an orthonormal one) by a suitable ma- 
trix a € GL(m,R), this shows that (i) is equivalent to (ii). 

That (i) implies (iii) was proved in Theorem 3.5.21. 
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We prove that (iii) implies (ii). Fix a smooth map €: R > R™” and an 
element (po,e0) € O(M). Define 
Bp := €) : Ty) M > R™ 
and , 
f= | &(s)dsER”  forteR. 
0 


By (ii) there exists a development (®,~,7/) of M along R™ on all of R that 
satisfies the initial conditions 


(0) =po, ®(0) = o. 
Then 
P(t) = Boh,(0,t): TwyM>R™, = B(t) F(t) = 7'(t) = E@) 
for all t € R by Definition 3.5.18. Define 


e(t) := ©,(t,0)eo = ®(t) +: RR” + Tyy)M 


fort € R. Then (y,e) : R- F(M) is asmooth curve that satisfies the initial 
condition (7(0), e(0)) = (po, eo) and the differential equation 

V(t) = - )-*E(t) = e(t)€(t), 

E(t) = haw (Met) = haw (eé(t))e(@) 


by the Se aeecien formula. Thus 


(7(¢), €(4)) = Bey (7(4), et) 


for all t € R. This proves Corollary 3.5.25. 


It is of course easy to to give an example of a manifold which is not com- 
plete; e.g. if (®, y,7’) is any development of M along M’, then M \ {y(t1)} 
is not complete as the given development is only defined for t £ ty. In §4.6 
we give equivalent characterizations of completeness. In particular, we will 
see that any compact submanifold of R” is complete. 


Exercise 3.5.26. An affine subspace of R” is a subset of the form 


E=p+E={pt+v|vek} 


where E C R” is a linear subspace and p€ R”. Prove that every affine 
subspace of R” is a complete submanifold. 
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3.6 Christoffel Symbols 


The goal of this subsection is to examine the covariant derivative in local 
coordinates on an embedded manifold M C R” of dimension m. Let 


o:U>2 


be a coordinate chart, defined on an M-open subset U C M with values in 
an open set 2 Cc R™, and denote its inverse by 


w:=o!':QA5UCM. 


At this point it is convenient to use superscripts for the coordinates of a 
vector x € Q. Thus we write 


g = (a',...,27) E92. 


If p = (a) € U is the corresponding element of M, then the tangent space 
of M at p is the image of the linear map dy(x) : R™ — R” (Theorem 2.2.3) 


and thus two tangent vectors v,w € T,M can be written in the form 


v= ab(aje = SE" (a, 
; (3.6.1) 
w = dy(x)n = Yo SE (0) 


for € = (€1,...,€™) € R™ and 7 = (7,...,7™) € R™. Recall that the re- 
striction of the inner product in the ambient space R” to the tangent space 
is the first fundamental form g, : T,M x T,M — R (Definition 3.1.1). Thus 


gp(v, w) = (v,w) = YS Egij(a)n’, (3.6.2) 
ij=l 
where the functions g;; : 2 — R are defined by 


Gij (Z) = (3 (3), 55) forr EQ. (3.6.3) 


In other words, the first fundamental form is in local coordinates represented 
by the matrix valued function g = (Gig et iS Re 
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Figure 3.7: A vector field along a curve in local coordinates. 


Now let c= (c',...,¢”) : I Q be a smooth curve in 2, defined on an 


interval J Cc R, and consider the curve 
y=wocec:Il~M 


(see Figure 3.7 ). Our goal is to describe the operator X ++ VX on the space 
of vector fields along y in local coordinates. Let X : [ — R” be a vector 
field along y. Then 


X(t) € Ty()M = Ty (c(t) M =im (di)(c(t)) >-R”™ > R”) 
for every t € I and hence there exists a unique smooth function 
€=(é',...,€"):I 3 R™ 


such that 


x)= = Ys é*(t) AG (3.6.4) 


Differentiate this identity to obtain 


=n Ze 
i=l 


2 
+ Leman 2 (cl). 8.65) 


) anioai 
ij=1 


We examine the projection VX (t) = H(7(t))X (t) of this vector onto the tan- 
gent space of M at y(t). The first summand on the right in (3.6.5) is already 
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tangent to M. For the second summand we simply observe that the vec- 
tor II(7(x))0?w/Ox'dx! lies in tangent space Ty,)M and can therefore be 
expressed as a linear combination of the basis vectors OW/Oz!,...,0y)/Ox™. 
The coefficients will be denoted by Tk (2). Thus there exist smooth func- 
tions ry. >Q > R for 1,97,k =1,...,m such that 


2 m 
HW(2)) (0) = THe) 4 (0) (3.6.6) 


for alla € Qand alli,7 € {1,...,m}. The coefficients ry. :Q > R are called 
the Christoffel symbols associated to the coordinate chart 6: U > Q. To 
sum up we have proved the following. 


Lemma 3.6.1. Let c: 1 > Q be a smooth curve and define 
y:=woc:IloM. 


If €: I — R™ is a smooth map and X € Vect(y) is given by (3.6.4), then 
its covariant derivative at time t € I is given by 


E+ DY) TH(WM)E OPW ot (c(t), (3.6.7) 


1 ij=l 


Ma 


VX(t) = 


> 
ll 


where the rk are the Christoffel symbols defined by (3.6.6). 


Our next goal is to understand how the Christoffel symbols are deter- 
mined by the metric in local coordinates. Recall from equation (3.6.2) that 
the inner products on the tangent spaces inherited from the standard Eu- 
clidean inner product on the ambient space R” are in local coordinates 
represented by the matrix valued function 


g= (95 ej=1 :Q75 R™*™ 


=f Ou ow 
Gif = (x a), (3.6.8) 


We shall see that the Christoffel symbols are completely determined by the 
functions gj; : (2 — R. Here are first some elementary observations. 


given by 
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Remark 3.6.2. The matrix g(x) € R”*"™ is symmetric and positive definite 
for every  € 2. This follows from the fact that the matrix dz(a) € R"*” 
has rank m and the matrix g(x) is given by 

g(x) = d(x)" dy (a) 
Thus €'9(x)é = |dy)(x)€|? > 0 for all € € R™ \ {0}. 
Remark 3.6.3. For x € {2 we have det(g(x)) > 0 by Remark 3.6.2 and 


so the matrix g(x) is invertible. Denote the entries of the inverse ma- 
trix g(v)~! € R™*™ by g(x). They are determined by the condition 


Since g(a) is symmetric and positive definite, so is its inverse matrix g(x)~!. 
In particular, we have g*(x) = g(a) for all x € Q and all k,@ € {1,...,m}. 


Remark 3.6.4. Suppose that X,Y € Vect(y) are vector fields along our 
curve y= wWoc:I—>M and £,n: I > R” are defined by 


oe . 8 mo a 
x)= VeOHlem), YO = LO elo). 
= 
Then the inner product of X(t) and Y(t) is given by 


m 
(X(t), ¥@)) = YO FE Ogis(ct)) 7). 
ij=l 
Lemma 3.6.5 (Christoffel symbols). Let QC R™ be an open set and 
let gij : 2-4 R for i,7 =1,...,m be smooth functions such that each ma- 
trit (gij(%))P—-1 is symmetric and positive definite. Let rk : 2 > R be 
smooth functions for 1,97,k =1,...,m. Then the rk satisfy the conditions 


te 
Ty =T hi Ant = ‘> (oil + ginl'§) (3.6.9) 


for i,j,k,€=1,...,m if and only if they are given by 
el (O90 , O90; O93 
i= as ‘a Nis 3.6.10 
a 29 2 & On Oa ( ) 
If the rk are defined by (3.6.6) and the gi; by (3.6.8), then the Ty. sat- 
isfy (3.6.9) and hence are given by (3.6.10). 
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Proof. Suppose that the ry, are given by (3.6.6) and the gi; by (3.6.8). Let 
e:ITagQd, €,n: 13> R™ 
be smooth functions and suppose that the vector fields X,Y along the curve 
y:=woc:Il>aM 


are given by 


x)= Lee), vO= Lwro ew). 


Dropping the argument t in each term, we obtain from Remark 3.6.4 and 


Lemma 3.6.1 that 
Sale 
4,9 


(X,VY) = So gn(clé | a+ SOT ielen’e | , 
i,k jl 


(X,Y) 


(VX,Y) = Digg) | e+ OTE OES | 7. 
j,k ae 
Hence it follows from equation (3.2.5) in Lemma 3.2.4 that 


He) —(X, VY) —(VX,Y) 


~ 7 gadd Ein? + giz (CEP + Set Oi ( ioe) 


a 


- X gi (Clem — S7 gix OV Fe(c)E'n? 


ijkl 
Son olen — So oni (OTE 
1,j,k,e 
= (4 )— da givleJE )— Lame g 
i wa a FRAC me 
uJ; 


This holds for all smooth maps c: J > Qand €,7: I > R™, so the rk satisfy 
the second equation in (3.6.9). That they are symmetric in i and j is obvious. 
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To prove that (3.6.9) is equivalent to (3.6.10), define 


m 
Pej = S- 9en0',. 
k=1 


Then (3.6.9) is equivalent to 


Ogi 4 


Pej = Peja, Art Dijze + Dyie. 


and (3.6.10) is equivalent to 


Doe wt (OM 4 2905 P93 
“1 9\ aati " Oxi Ax!) 


If the I'y;; are given by (3.6.13), then they satisfy 


Peis = Pej 
and 
Ogi; Age O9;e . Ogi , OGje Ogie 
ADage + OT jue = Ox? Ori Ox? * Ox? z Ort = Ox 
Ogi; 
= 2 
Ox? 


for all i, j, 2. Conversely, if the ['y;; satisfy (3.6.12), then 


Ogi 4 


Aut — lise + Uae, 

Ogei 

Aci = Te; + Ties = Teg + Tize, 
Oge; 

i ee ee 
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(3.6.11) 


(3.6.12) 


(3.6.13) 


Take the sum of the last two minus the first of these equations to obtain 


Oge 4, Oge; 99%; 
Ori Ox? — Or 


= 2145. 


Thus (3.6.12) is equivalent to (3.6.13) and so (3.6.9) is equivalent to (3.6.10). 


This proves Lemma 3.6.5. 
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3.7 Riemannian Metrics* 


We wish to carry over the fundamental notions of differential geometry to 
the intrinsic setting. First we need an inner product on the tangent spaces 
to replace the first fundamental form in Definition 3.1.1. This is the content 
of Definition 3.7.1 and Lemma 3.7.4 below. Second we must introduce the 
covariant derivative of a vector field along a curve. With this understood all 
the definitions, theorems, and proofs in this chapter carry over in an almost 
word by word fashion to the intrinsic setting. 


3.7.1 Existence of Riemannian Metrics 


We will always consider norms that are induced by inner products. But in 
general there is no ambient space that can induce an inner product on each 
tangent space. This leads to the following definition. 


Definition 3.7.1. Let M be a smooth m-manifold. A Riemannian metric 
on M is a collection of inner products 


T,)M x T,M > R: (v,w)  gp(v,w), 
one for every p € M, such that the map 
M—R:p> g(X(p), ¥(p)) 


is smooth for every pair of vector fields X,Y € Vect(M). We will also 
denote the inner product by (v,w), and drop the subscript p if the base 
point is understood from the context. A smooth manifold equipped with a 
Riemannian metric is called a Riemannian manifold. 


Example 3.7.2. If M@ Cc R” is a smooth submanifold, then a Riemannian 
metric on M is given by restricting the standard inner product on R” to 
the tangent spaces T,M C R”. This is the first fundamental form of an 
embedded manifold (see Definition 3.1.1). 


More generally, assume that M is a Riemannian m-manifold in the in- 


trinsic sense of Definition 3.7.1 with an atlas Y = {(¢a,Ua)}aca-. Then the 
Riemannian metric g determines a collection of smooth functions 


Ga = (Ge,ig jai? Pa(Ua) +R, (3.7.1) 
one for each a € A, defined by 
E"ga(a)n:=gpl(v,w), da(p)=2, dba(p)v=£, doa(p)w =n, (3.7.2) 
for x € da(U,) and £,7 € R™. 
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Each matrix go(z) is symmetrix and positive definite. Note that the 
tangent vectors v and w in (3.7.2) can also be written in the form 


v= lexan Ww — [o, np. 
Choosing standard basis vectors 
g = &, = €; 


in R™ we obtain 


[a, eilp = dba(p) ei =: -(p) 


and hence 


doslt) = (2 (65a), 325(a4e))) (3.7.3) 


For different coordinate charts the maps gq and gg are related through the 
transition map 


pa = $80 5" : da(UaN Ug) + o6(Ua Us) 
via 
ga(x) = dbga(x)"99(b80(x))dbga(z) (3.7.4) 
for © € @a(Ua Ug). Equation (3.7.4) can also be written in the shorthand 
notation 
Jo = Pba9B 
fora, BEA. 


Exercise 3.7.3. Every collection of smooth maps 
Ja: ba(Ua) 4+ R™™*™ 


with values in the set of positive definite symmetric matrices that satis- 
fies (3.7.4) for all a, 8 € A determines a global Riemannian metric via (3.7.2). 


In this intrinsic setting there is no canonical metric on M (such as the 
metric induced by R” on an embedded manifold). In fact, it is not completely 
obvious that a manifold admits a Riemannian metric and this is the content 
of the next lemma. 
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Lemma 3.7.4. Every paracompact Hausdorff manifold admits a Rieman- 
nian metric. 


Proof. Let m be the dimension of M and let # = {(¢a, Ua) }aca be an atlas 
on M. By Theorem 2.9.9 there is a partition of unity {9a}ac4 subordinate 
to the open cover {Ua }aca. Now there are two equivalent ways to construct 
a Riemannian metric on M. 


The first method is to carry over the standard inner product on R™ to 
the tangent spaces T,,M for p € U, via the coordinate chart ¢,, multiply 
the resulting Riemannian metric on U, by the compactly supported function 
Aq, extend it by zero to all of M, and then take the sum over all a. This 
leads to the following formula. The inner product of two tangent vectors 
v,w € T,M is defined by 


= >) Aa(p)(dba(p)v, dba(p)w), (3.7.5) 


pcua 


where the sum runs over all a € A with p € Ug and the inner product is 
the standard inner product on R™. Since supp(@.) C Ua for each a and the 
sum is locally finite we find that the function 


M —R: p+ (X(p),Y(p))p 


is smooth for every pair of vector fields X,Y € Vect(M). Moreover, the right 
hand side of (3.7.5) is symmetric in v and w and is positive for v = w 4 0 
because each summand is nonnegative and each summand with 6,(p) > 0 is 
positive. Thus equation (3.7.5) defines a Riemannian metric on M. 


The second method is to define the functions 
‘a: Ga(Ug) > R™*™ 


by 
= S°0(65"(#))dbya (2) dobya(x) (3.7.6) 


yEA 


for x € d@a(Ua) where each summand is defined on ¢g(Uq M Uy) and is 
understood to be zero for x ¢ ¢ag(Ua M Uy). We leave it to the reader to 
verify that these functions are smooth and satisfy the condition (3.7.4) for 
all a, 8 € A. Moreover, the formulas (3.7.5) and (3.7.6) determine the same 
Riemannian metric on M. (Prove this!) This proves Lemma 3.7.4. 
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3.7.2. Two Examples 


Example 3.7.5 (Fubini-Study metric). The complex projective space 
carries a natural Riemannian metric, defined as follows. Identify CP” with 
the quotient of the unit sphere S?°+! Cc C"+! by the diagonal action of 
the circle S!, ie. CP" = $?"+1/51. Then the tangent space of CP” at the 
equivalence class 
[2] = [zo :---: Zn] € CP” 

of a point z = (2,...,2n) € S?"*! can be identified with the orthogonal 
complement of Cz in C"+!. Now choose the inner product on TiCP” to 
be the one inherited from the standard inner product on C+! via this 
identification. The resulting metric on CP” is called the Fubini—Study 
metric. Exercise: Prove that the action of U(n +1) on C"t! induces a 
transitive action of the quotient group 


PSU(n + 1) := U(n+1)/S" 
by isometries. If z € $?"+1, prove that the unitary matrix 
g := 2227-1 


descends to an isometry ¢ on CP” with fixed point p := [z] and dé(p) = —id. 
Show that, in the case n = 1, the pullback of the Fubini-Study metric on CP! 
under the stereographic projection 


S?\ {(0,0,1)} + CP*\ {[0 1} : (1, 42,43) 4 c _ a 


1- 3 

is one quarter of the standard metric on S?. 

Example 3.7.6. Think of the complex Grassmannian G;(C") of k-planes 
in C” as a quotient of the space 


Fi(C”) := {D eck p*D = at 


of unitary k-frames in C” by the right action of the unitary group U(k). 
The space F;,(C”) inherits a Riemannian metric from the ambient Euclidean 
space C”**, Show that the tangent space of G;,(C”) at a point A = imD, 
with D € F,(C”) can be identified with the space 


TrGp(C") = {D EC"Xk| p*D = of 


Define the inner product on this tangent space to be the restriction of the 
standard inner product on C”** to this subspace. Exercise: Prove that 
the unitary group U(n) acts on Gz(C”) by isometries. 
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3.7.3. The Levi-Civita Connection 


A subtle point in this discussion is how to extend the notion of covariant 
derivative to general Riemannian manifolds. In this case the idea of project- 
ing the derivative in the ambient space orthogonally onto the tangent space 
has no obvious analogue. Instead we shall see how the covariant derivatives 
of vector fields along curves can be characterized by several axioms and 
these can be used to define the covariant derivative in the intrinsic setting. 
An alternative, but somewhat less satisfactory, approach is to carry over 
the formula for the covariant derivative in local coordinates to the intrinsic 
setting and show that the result is independent of the choice of the coor- 
dinate chart. Of course, these approaches are equivalent and lead to the 
same result. We formulate them as a series of exercises. The details are 
straightforward. 

Assume throughout that M is a Riemannian m-manifold with an atlas 


A = (ba, Ua) }aea 


and suppose that the Riemannian metric is in local coordinates given by 
Go = (Ga,ij)ij=1 ? Pa(Ua) + R™™ 
for a € A. These functions satisfy (3.7.4) for all a, 8 € A. 


Definition 3.7.7. Let f : N — M be a smooth map between manifolds. A 
vector field along f is a collection of tangent vectors 


X(q) € Ty(qyM, 
one for each q € N, such that the map 
N—+TM :q+ (f(q), X(q)) 
is smooth. The space of vector fields along f will be denoted by Vect(f). 
As before we will not distinguish in notation between the collection of 
tangent vectors X(q) € Ty(q)M and the associated map N + TM and 
denote them both by X. The following theorem introduces the Levi-Civita 


connection as a collection of linear operators V : Vect(y) — Vect(y), one for 
each smooth curve y: I > M. 
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Theorem 3.7.8 (Levi-Civita connection). There exists a unique collec- 
tion of linear operators 


V : Vect(7y) > Vect(7) 


(called the covariant derivative), one for every smooth curve y : I > M 
on an open interval J C R, satisfying the following axioms. 


(Leibniz Rule) For every smooth curve y: J > M, every smooth function 
A: I —R, and every vector field X € Vect(7), we have 


VAs) Se AVX, (3.7.7) 


(Chain Rule) Let 2 Cc R” be an open set, let c: J + Q be a smooth curve, 
let y: Q — M beasmooth map, and let X be a smooth vector field along y. 
Denote by V;X the covariant derivative of X along the curve x’ +> (2) (with 
the other coordinates fixed). Then V;X is a smooth vector field along y and 
the covariant derivative of the vector field X oc € Vect(y 0c) is 


ViXecd=>_ HOV X (el). (3.7.8) 
j=l 


(Riemannian) For any two vector fields X,Y € Vect(7y) we have 


£ (X,Y) = (VX,Y)+(X, VY). (3.7.9) 


(Torsion-free) Let J, C R be open intervals and y : I x J ~ M be 
a smooth map. Denote by V, the covariant derivative along the curve 
st» ¥(s,t) (with t fixed) and by VY; the covariant derivative along the curve 
t+ 7(s,t) (with s fixed). Then 


Very = Vi0s7- (3.7.10) 


Proof. The proof is based on a reformulation of the axioms in local co- 
ordinates. The (Leibnitz Rule) and (Chain Rule) axioms assert that the 
covariant derivative is in local coordinates given by Christoffel symbols rk 
as in equation (3.6.7) in Lemma 3.6.1. The (Riemannian) and (Torsion-free) 
axioms assert that the Christoffel symbols satisfy the equations in (3.6.9) 


and hence, by Lemma 3.6.5, are given by (3.6.10). (See also Exercise 3.7.10.) 
This proves Theorem 3.7.8. 
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Exercise 3.7.9. The Christoffel symbols of the Riemannian metric are 
the functions I“ : ba(Ua) + R. defined by 


aij * 
el (O9a0i , Wai — Ge,ij 
ko ke ali, PGa,t) aij 
y= ya ( AP ae ) (3.7.11) 


l=1 


(see Lemma 3.6.5). Prove that they are related by the equation 


i! j 
2b ais ia Oba. a y oe ° 7) OP OF Ba 
k 


“Oak tI ~ Opiaas Ox* Oars 


for all a, 6 € A. 


Exercise 3.7.10. Denote Wo := 65! : da(Ua) + M. Prove that the covari- 
ant derivative of a vector field 


- Daly (OE (call) 


along Y = Wa ° Ca: I > M is given by 


=>) (&e + 2 rh glelt g(t) (6) oe (ca(t)). (3.7.12) 


k=). j=l 
Prove that VX is independent of the choice of the coordinate chart. 


Exercise 3.7.11. Let Q C R? be open and \: 2 > (0,00) be a smooth 
function. Let g : Q > R?*? be given by 


0@= ("9 sty): 


Compute the Christoffel symbols rk via (3.6.10). 


Exercise 3.7.12. Let ¢: S” \ {(0,0,1)} — C be the stereographic projec- 


tion, given by 
Pi P2 
o(p) = ( ) 
) 1—p3 1—ps 


Prove that the metric g : R? + R?*? has the form g(x) = \(x)11 where 


4 
(1 + |x|?) 


AGE) = for « = (2, 2”) € R’. 
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3.7.4 Basic Vector Fields in the Intrinsic Setting 


Let M be a Riemannian m-manifold with an atlas # = {(¢a,Ua)}aea: 
Then the frame bundle (3.4.1) admits the structure of a smooth manifold 
with the open cover Ug, := a (Ug) and coordinate charts 


bo : Ua + ba(Ua) x GL(m) 
given by 

balP, €) = (¢a(p), dba(p)e) - 
The derivatives of the horizontal curves in Definition 3.4.6 form a horizon- 
tal subbundle H Cc T¥(M) of the tangent bundle of the Frame bundle 


whose fibers H(p,-) over an element (p,e) € F(M) can in local coordinates 
be described as follows. Let 


p= GD, a := dda(p)e € GL(m). (3.7.13) 
and let (z,a@) € R™ x R™*™. This pair has the form 
(2,4) = dda(p,e)(B,2), — (P,2) € Hye), (3.7.14) 
if and only if 
ay =-— S > 1k, (a)8'a} (3.7.15) 
ij=l 
for k,@=1,...,m, where the functions ce : ba(Ua,) > R are the Christof- 


fel symbols defined by (3.7.11). Thus a tangent vector (p,€) € Tip¢)F(M) 
is horizontal if and only if its coordinates (Z,@) in (3.7.14) satisfy (3.7.15). 
Hence, for every vector € € R™, there exists a unique horizontal vector 
field Be € Vect(F(M)) (the basic vector field associated to €) such that 


dn(p, e)Be(p,e) = e& 
for all (p,e) € F(M). This vector field assigns to a pair (p,e) € F(M) with 
the coordinates (#,a) € R™ x GL(m) as in (3.7.13) the horizontal tangent 
vector (p,€) € Hip) C Tipe) ¥(M) whose coordinates (7,a@) € R™ x R™*™ 
satisfy (3.7.15) and % = a€. 


Exercise 3.7.13. Verify the equivalence of (3.7.14) and (3.7.15). Prove 
that the notion of a horizontal tangent vector of F(/) is independent of 
the choice of the coordinate chart. Hint: Use Exercise 3.7.9. 


Exercise 3.7.14. Examine the orthonormal frame bundle O(M) in the 
intrinsic setting. 


Exercise 3.7.15. Carry over the proofs of Theorem 3.3.4, Theorem 3.3.6, 
and Theorem 3.5.21 to the intrinsic setting. 


174 CHAPTER 3. THE LEVLCIVITA CONNECTION 


Chapter 4 


Geodesics 


This chapter introduces geodesics in Riemannian manifolds. It begins in 84.1 
by introducing geodesics as extremals of the energy and length functionals 
and characterizing them as solutions of a second order differential equation. 
In 84.2 we show that minimizing the length with fixed endpoints gives rise 
to an intrinsic distance function d: M x M —> R which induces the topol- 
ogy M inherits from the ambient space R”. 84.3 introduces the exponential 
map, §4.4 shows that geodesics minimize the length on short time intervals, 
84.5 establishes the existence of geodesically convex neighborhoods, and 84.6 
shows that the geodesic flow is complete if and only if (IW, d) is a complete 
metric space, and that in the complete case any two points are joined by a 
minimal geodesic. §4.7 discusses geodesics in the intrinsic setting. 


4.1 Length and Energy 


4.1.1 The Length and Energy Functionals 


The concept of a geodesic in a manifold generalizes that of a straight line in 
Euclidean space. A straight line has parametrizations of form t + p+ a(t)v 
where 0: R > R is a diffeomorphism and p,v € R”. Different choices of o 
yield different parametrizations of the same line. Certain parametrizations 
are preferred, for example those parametrizations which are “proportional 
to the arclength”, i.e. where o(t) = at +b for constants a,b € R, so that 
the tangent vector o(t)v has constant length. The same distinctions can 
be made for geodesics. Some authors use the term geodesic to include all 
parametrizations of a geodesic while others restrict the term to cover only 
geodesics parametrized proportional to the arclength. We follow the latter 
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course, referring to the more general concept as a “reparametrized geodesic” . 
Thus a reparametrized geodesic need not be a geodesic. 
We assume throughout that M C R” is a smooth m-manifold. 


Definition 4.1.1 (Length and energy). Let I = [a,b] CR be a com- 
pact interval with a <b and let y: I M be a smooth curve in M. The 
length L(y) and the energy E(7) are defined by 


b 
La) := | | at, (4.1.1) 


b 
E(7) = 5 y(t)|? de. (4.1.2) 

a 
A variation of y is a family of smooth curves yy : I + M, where s 
ranges over the reals, such that the map Rx I > M : (s,t) } y(t) is 
smooth and yo = y. The variation {7s}ser is said to have fixed endpoints 
iff ys(a) = y(a) and y.(b) = y(b) for alls ER. 
Remark 4.1.2. The length of a continuous function y: [a,b] > R” can 
be defined as the supremum of the numbers 4 ly(ti) — y(ts-1)| over all 
partitions a = to < t) <---<ty =) of the interval [a,b]. By a theorem in 
first year analysis [64] this supremum is finite whenever y is continuously 
differentiable and is given by (4.1.1). 


We shall sometimes suppress the notation for the endpoints of a,b € I. 
When 7(a) = p and y(b) = q we say that y is a curve from p to qg. One 
can always compose y with an affine reparametrization t/ = a + (b—a)t to 
obtain a new curve 9/(t) := y(t’) on the unit interval 0<t<1. This new 
curve satisfies L(y’) = L(y) and E(y’) = (b— a)E(y). More generally, the 
length L(y), but not the energy E(y), is invariant under reparametrization. 


Remark 4.1.3 (Reparametrization). Let J = [a,b] and I’ = [a’,b'| be 
compact intevrals. If y : J + R” is a smooth curve anda: I' >IT isa 
smooth function such that o(a’) = a, o(b’) = b, and o(t) > 0 for all t € I’, 
then 

L(yoo) = L(y). (4.1.3) 


To see this, we compute 


b’ 
Lyea)= f |e 


Suo(t)) 
Here second equation follows from the chain rule and the fact that a(t’) > 0 
for all t’ € [a’, b'], and the third equation follows from the change of variables 
formula for the Riemann integral. This proves equation (4.1.3). 


) 
d= / s(o(t)) Ja) ae’ = L(9). 


us 
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Theorem 4.1.4 (Characterization of geodesics). Let I = [a,b] C R be 
a compact interval and let y: I — M be a smooth curve. Then the following 
are equivalent. 


(i) y is an extremal of the energy functional, i.e. 


d 


=| Baj=t 
Fe (Ys) 


s=0 


for every variation {ys}ser of y with fixed endpoints. 

(ii) y is parametrized proportional to the arclength, i.e. the veloc- 
ity |\y(t)| =c> 0 is constant, and either y is constant, t.e. y(t) =p =q for 
allt EI, orc >0 andy is an extremal of the length functional, i.e. 


d 


~| L(y.) =0 
i: (7s) 


s=0 


for every variation {Ys}ser of y with fixed endpoints. 

(iii) The velocity vector of y is parallel, i.e. Vy(t) =0 for allt € TI. 

(iv) The acceleration of y is normal to M, i.e. ¥(t) 1 Tya)M for allt € I. 
(v) If (®, 7,9’) is a development of M along M' = R"™, then 7 : I + R™ is 
a straight line parametrized proportional to the arclength, i.e. 4/ = 0. 


Proof. See page 180. 


Definition 4.1.5 (Geodesic). A smooth curve y: I — M on an interval I 
is called a geodesic iff its restriction to each compact subinterval satisfies 
the equivalent conditions of Theorem 4.1.4. So y is a geodesic if and only if 


Vy7(t) =0 for allt € I. (4.1.4) 


By the Gauf—Weingarten formula (3.2.2) with X = ¥ this is equivalent to 


F(t) = hy (V4), Y@) for allt € I. (4.1.5) 


Remark 4.1.6. (i) The conditions (i) and (ii) in Theorem 4.1.4 are mean- 
ingless when J is not compact because then the curve has at most one 
endpoint and the length and energy integrals may be infinite. However, the 
conditions (iii), (iv), and (v) in Theorem 4.1.4 are equivalent for smooth 
curves on any interval, compact or not. a 

(ii) The function s+> E(y,) associated to a smooth variation is always 
smooth and so condition (i) in Theorem 4.1.4 is meaningful. However, more 
care has to be taken in part (ii) because the function s +> L(ys) need not be 
differentiable. It is differentiable at s = 0 whenever 7(t) 4 0 for all t € I. 
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4.1.2 The Space of Paths 


Fix two points p,q € M and a compact interval J = [a,b] and denote by 
Qp,q = Qpq(1) = {y: 1 4 M|7 is smooth and 7(a) = p, 7(b) = q} 


the space of smooth curves in M from p to q, defined on the interval J. Then 
the length and energy are functionals L, EF : Q»), — R and their extremal 
points can be understood as critical points as we now explain. 

We may think of the space Qp¢ as a kind of “infinite-dimensional man- 
ifold”. This is to be understood in a heuristic sense and we use these terms 
here to emphasize an analogy. Of course, the space Q,4 is not a manifold 
in the strict sense of the word. To begin with it is not embedded in any 
finite-dimensional Euclidean space. However, it has many features in com- 
mon with manifolds. The first is that we can speak of smooth curves in Qyq. 
Of course Q, 4 is itself a space of curves in M. Thus a smooth curve in Qp 4 
would then be a curve of curves, namly a map R + Qpq: 8 7s that as- 
signs to each real number s a smooth curve 7, : J > M satisfying y,(a) = p 
and ys(b) = gq. We shall call such a curve of curves smooth iff the associated 
map R x I> M : (s,t) + ¥5(t) is smooth. Thus smooth curves in Qp4 are 
the variations of 7 with fixed endpoints introduced in Definition 4.1.1. 

Having defined what we mean by a smooth curve in 2,4 we can also 
differentiate such a curve with respect to s. Here we can simply recall that, 
since M C R”, we have a smooth map R x J > R” and the derivative of 
the curve s ++ Ys in Qp4 can simply be understood as the partial derivative 
of the map (s,t) +> ys(t) with respect to s. Thus, in analogy with embed- 
ded manifolds, we define the tangent space of the space of curves Qy 4 
at y as the set of all derivatives of smooth curves R > Qp 4: 5++ Ys passing 
through ¥, i.e. 


Ys 
s=0 


a) 
Ty Qp,q = { As R > Qp¢ : 84 Ys is smooth and yo = 7} . 


Let us denote such a partial derivative by X(t) := 2 sao Yell) € Ty M. 
Thus we obtain a smooth vector field along y. Since y,(a) = p and y,(b) = q 
for all s, this vector field must vanish at t = a,b. This suggests the formula 


Ty pg = {X € Vect(y) | X(a) = 0, X(b) = 0}. (4.1.6) 


That every tangent vector of the path space (Q),_ at 7 is a vector field along 
vanishing at the endpoints follows from the above discussion. The converse 
inclusion is the content of the next lemma. 
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Lemma 4.1.7. Let p,qeEM, ye Qpq, and X € Vect(y) with X(a) = 0 
and X(b) =0. Then there exists a smooth map R > Qpq: 8 Ys such that 


0 


3s = ya(t) = X(E) for allt € I. (4.1.7) 


yo(t) = y(¢), 


Proof. The proof has two steps. 


Step 1. There exists smooth map M x I > R": (r,t) Y;(r) with compact 
support such that Y;(r) € TM for allt € I andr € M, Y;(r) =0 forallt € I 
and allr€ M\ K, and Y,(r) = ¥,(r) =0 for allr e M. 


Define Z;(r) := I(r) X(t) for t € I and r € M. Choose an open set U C R” 
such that y(I) C U and UMM is compact (e.g. take U := U,<¢<y Be (Y(t) 
for ¢ > 0 sufficiently small). Now let 8: R" — [0,1] be a smooth cutoff 
function with support in the unit ball such that 6(0) =1 and define the 
vector fields Y; by ¥;(r) := B(et(r — y(t)))Z%(r) fort € I andr € M. 


Step 2. We prove the lemma. 


The vector field Y;: “@—+ TM in Step 1 is complete for each ¢. Thus 
there exists a unique smooth map Rx I > M : (s,t)'> ys(t) such that, 
for each t € I, the curve R- M:s+)4,(t) is the unique solution of the 
differential equation £45(t) = Y;(ys(t)) with yo(t) = y(t). These maps yz 
satisfies (4.1.7) by Step 1. 


We can now define the derivative of the energy functional FE at 7 
in the direction of a tangent vector X € TQ), by 
d 
dE()X = | Bln), (4.1.8) 
5|s=0 
where s +> 7s is as in Lemma 4.1.7. Similarly, the derivative of the length 


functional L at y in the direction of X € T,Q), is defined by 


dL(y)X := a 


L(ys). (4.1.9) 
s=0 
To define (4.1.8) and (4.1.9) the functions s +> E(y,) and s+> L(y;) must 
be differentiable at s=0. This is true for E but it only holds for L 
when 4(t) 4 0 for all t € J. Second, we must show that the right hand sides 
of (4.1.8) and (4.1.9) depend only on X and not on the choice of {7s}ser. 
Third, we must verify that dE(7) : TyQp.q 3 R and dL (7) : Aq 4 R are lin- 
ear maps. This is an exercise in first year analysis (see also the proof of Theo- 
rem 4.1.4). A curve y € Qp, is is then an extremal point of E (respectively L 
when 4(t) #0 for all ¢) if and only if dE(y) = 0 (respectively dL(y) = 0). 
Such a curve is also called a critical point of E (respectively L). 
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4.1.3 Characterization of Geodesics 


Proof of Theorem 4.1.4. The equivalence of (iii) and (iv) follows directly 
from the equations V7(t) = H(y(t))¥(t) and ker(II(y(t))) = Tay 
We prove that (i) is equivalent to (iii) and (iv). Let X € T,Qp,¢ and 


choose a smooth curve of curves R > Qyq : 5+ 75 satisfying (4.1.7). Then 
the function (s,t) + |¥5(t)|? is smooth and hence 
dB()X = 4) B) 
Ue ~ ds s=0 a 
d 2 
ie = t)|° dt 
z|_,3,f, Hol 
1 ee 
=o f al bo a 
. aa (4.1.10) 


That (iii) implies (i) follows directly from this identity. To prove that (i) im- 
plies (iv) we argue indirectly and assume that there exists a point to € [0,1] 
such that 7(to) is not orthogonal to T\;,)M. Then there exists a vec- 
tor vo € Ty4))M such that (¥(to),vo) > 0. We may assume without loss 
of generality that a<tg<b. Then there exists a constant ¢ >0 such 
that a<to-—e<tot+e<band 


tp-e<t<tyte = — (A(t), U(4(t))v0) >0. 


Now choose a smooth cutoff function 6 : I — [0,1] such that 6(t) = 0 for 
all t € I with |t — to] > € and (to) = 1. Define X € Ty Qy 4 by 


X(t) := B(tM(4(t))v9 for tT. 
Then (¥(t), X(t)) > 0 for all t and (¥(to), X(to)) > 0. Hence 
b 
aE()X =~ f (lt), X(H) dt <0 


and so 7 does not satisfy (i). Thus (i) is equivalent to (iii) and (iv). 
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We prove that (i) is equivalent to (ii). Assume first that + satisfies (i). 
Then ¥ also satisfies (iv) and hence ¥(t) L T\(¢)M for all ¢ ¢ J. This implies 


Hence the function I > R : t 4 |4(t)|? is constant. Choose c > 0 such 
that |+(t)| =c. If c= 0, then y(t) is constant and so y(t) =p=gq. Ifc>0, 


then 
d , 
2 [ owl a 
s=0 Ya 


ds 

=f 2] aoa 
= far (oto. 2 
=1 f(s.) a 


a 


dL(y)X = 


4s «)) dt 


1 
= dE) x 


Thus, in the case c > 0, y is an extremal point of F if and only if it is an 
extremal point of L. Hence (i) is equivalent to (ii). 

We prove that (iii) is equivalent to (v). Let (®,7, 7) be a development 
of M along M’ =R"™. Then 4/(t) = ®(t)4(t) and £0(t)X(t) = O(t)VX(t) 
for all X € Vect(y) and allt € J. Take X = ¥ to obtain +/(t) = ®(t)V¥(t) for 
allt € J. Thus V¥ = 0 if and only if 7/ = 0. This proves Theorem 4.1.4. 


Remark 4.1.3 shows that reparametrization by a nundecreasing surjective 


map a: I’ 4 I gives rise to map 
Qnq(L) 4 ql’) :yH you 


which preserves the length functional, i.e. 


L(yoo) = L(y) 


for all y € Qpq(Z). Thus the chain rule in infinite dimensions should assert 
that if yoo is an extremal (i.e. critical) point of LZ, then y is an extremal point 
of L. Moreover, if o is a diffeomorphism, the map y 4} y 04 is bijective and 
should give rise to a bijective correspondence between the extremal points 
of L on Qp4(Z) and those on Qy4(J'). Finally, if the tangent vector field + 
vanishes nowhere, then y can be parametrized by the arclength. This is 
spelled out in more detail in the next exercise. 
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Exercise 4.1.8. Let y: I = [a,b] > M be a smooth curve such that 


4(t) £0 


b 
=f Hela. 


(i) Prove that there exists a unique diffeomorphism o : [0,7] — I such that 


for all t € J and define 


sy es = [Wo] as 


for all ¢’ € [0,7] and all t € [a,b]. Prove that 7/:= yoo: [0,7] > M is 
parametrized by the arclength, i.e. |4/(t’)| = 1 for all t’ € [0, T]. 


(ii) Prove that 


b 
ab(y)x =— f (Ve), X(O) at Vit):= HOP. (4.4.11) 


Hint: See the relevant formula in the proof of Theorem 4.1.4. 


(iii) Prove that 7 is an extremal point of L if and only if the curve 7 in 
part (i) is a geodesic. 

(iv) Prove that y is an extremal point of L if and only if there exists a 
geodesic 7’: I’ + M and a diffeomorphism a: I’ > I such that y/ = yoo. 


Next we generalize this exercise to cover the case where 7¥ is allowed to 
vanish. Recall from Remark 4.1.6 that the function s ++ L(y,) need not be 
differentiable. As an example consider the case where y = yo is constant 
(see also Exercise 4.4.12 below). 


Exercise 4.1.9. Let y: 1 — M be a smooth curve and let X € TQ) q(J). 
Choose a smooth curve of curves R > Qy (I) : s+ 7s that satisfies (4.1.7). 
Prove that the one-sided derivatives of the function s +> L(y) exist at s = 0 
and satisfy the inequalities 

sf Fo 


Exercise 4.1.10. Let (®,7,7') be a development of M along M’. Show 
that y is a geodesic in M if and only if 7 is a geodesic in M’. 


- | |x| dt < TL) _ 
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4.2 Distance 


Assume that M Cc R” is a connected smooth m-dimensional submanifold. 
Two point p,q € M are of distance |p — q| apart in the ambient Euclidean 
space R”. In this section we define a distance function which is more in- 
timately tied to M by minimizing the length functional over the space of 
curves in M with fixed endpoints. Thus it may happen that two points 
in M have a very short distance in R” but can only be joined by very long 
curves in M (see Figure 4.1). This leads to the intrinsic distance in M. 
Throughout we denote by I = (0, 1] the unit interval and, for p,q € M, by 


Qn = {y: [0,1] ~ M|7 is smooth and (0) = p, 7(1) = g} (4.2.1) 


the space of smooth paths on the unit interval joining p to g. Since M is 
connected the set Qp.4 is nonempty for all p,q € M. (Prove this!) 


Figure 4.1: Curves in M. 


Definition 4.2.1. The intrinsic distance between two points p,q € M is 
the real number d(p,q) > 0 defined by 


d(p,q) = gant L(y). (4.2.2) 


The inequality d(p,q) > 0 holds because each curve has nonnegative length 
and the inequality d(p,q) < co holds because Qyq # 0. 


Remark 4.2.2. Every smooth curve y : [0,1] > R” with endpoints (0) = p 
and 7(1) = q satisfies the inequality 


ua)= [ I4(8)| ae > [wad yah 


Thus d(p,q) > |p—q|. For y(t) :=p+t(q—p) we have equality and hence 
the straight lines minimize the length among all curves from p to q. 
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Lemma 4.2.3. The functiond: M x M — [0,00) defines a metric on M: 


(i) Ifp,q € M satisfy d(p,q) = 0, then p=q. 
(ii) For all p,q € M we have d(p,q) = d(q, p). 
(iii) For all p,q,r € M we have d(p,r) < d(p,q) + d(q,r). 


Proof. By Remark 4.2.2 we have d(p,q) > |p —4q| for all p,q € M and this 
proves part (i). Part (ii) follows from the fact that the curve 7(t) := y(1 — t) 
has the same length as y and belongs to Q4,, whenever y € Q),4. To prove 
part (iii) fix a constant ¢ > 0 and choose curves yo € Qp¢ and 7 € Qgr 
such that L(y) < d(p,q) +¢ and L(y) < d(q,r) +e. By Remark 4.1.3 we 
may assume without loss of generality that yo(1—t) = y(t) =¢ for t >0 
sufficiently small. Under this assumption the curve 


on 70(2¢), for 0<t < 1/2, 
- yi(2t—1), for1/2<t<1 


is smooth. Moreover, y(0) = p and y(1) = r and so y € Q,,. Thus 
d(p,r) < L(y) = L(yo) + L(m) < dlp, 4g) + d(q,r) + 2¢. 


Hence d(p,r) < d(p,q) + d(q,r) + 2¢ for every ¢ > 0. This proves part (iii) 
and Lemma 4.2.3. 


Remark 4.2.4. It is natural to ask if the infimum in (4.2.2) is always 
attained. This is easily seen not to be the case in general. For example, 
let M result from the Euclidean space R™ by removing a point pp. Then 
the distance d(p, q) = |p — q| is equal to the length of the line segment from p 
to q and any other curve from p to q is longer. Hence if pg is in the interior 
of this line segment, the infimum is not attained. We shall prove below that 


the infimum is attained whenever M is complete. 


Figure 4.2: A geodesic on the 2-sphere. 
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Example 4.2.5. Let 
M:=S? ={peR'| lp| = 1} 


be the unit sphere in R® and fix two points p,q € S?. Then d(p,q) is the 
length of the shortest curve on the 2-sphere connecting p and q. Such a 
curve is a segment on a great circle through p and q (see Figure 4.2) and its 
length is — 
d(p,q) = cos *((p,q)), (4.2.3) 


where (p,q) denotes the standard inner product, and we have 
0<d(p,q) <7. 


(See Example 4.3.11 below for details.) By Lemma 4.2.3 this defines a metric 
on $7. Exercise: Prove directly that (4.2.3) is a distance function on S?. 


We now have two topologies on our manifold M C R”, namely the topol- 
ogy determined by the metric d in Lemma 4.2.3 and the relative topology 
inherited from R”. The latter is also determined by a distance function, 
namely the extrinsic distance function defined as the restriction of the Eu- 
clidean distance function on R” to the subset MZ. We denote it by 


dy: MxM —+[0,00), — do(p,q) = |p al. (4.2.4) 


A natural question is if these two metrics d and dp induce the same topology 
on M. In other words is a subset U C M open with respect to do if and only 
if it is open with respect to d? Or, equivalently, does a sequence p, € M 
converge to po € M with respect to d if and only if it converges to po with 
respect to do? Lemma 4.2.7 answers this question in the affirmative. 


Exercise 4.2.6. Prove that every translation of R” and every orthogonal 


transformation preserves the lengths of curves. 


Lemma 4.2.7. For every pp © M we have 
d(p,q) _ 


p.a—po |p —q| 


Proof. See page 187. 


Lemma 4.2.8. Let po € M and let dp : Up > Qo be a coordinate chart onto 
an open subset of R™ such that its derivative dbo(po) : Ip M > R™ is an 
orthogonal transformation. Then 


pom @)= dual 


Proof. See page 188. 
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The lemmas imply that the topology M inherits as a subset of R’, the 
topology on M determined by the metric d, and the topology on M induced 
by the local coordinate systems on M are all the same. 


Corollary 4.2.9. For every subset U C M the following are equivalent. 
(i) U is open with respect to the metric d in (4.2.2). 


(ii) U is open with respect to the metric do in (4.2.4). 


(iii) For every coordinate chart ¢9 : Up > Qo of M onto an open sub- 
set Qo C R™ the set d9(Uo NU) is an open subset of R™. 


Proof. By Remark 4.2.2 we have 


lp — q| < d(p,q) (4.2.5) 


for all p,q € M. Thus the identity idjy : (M,d) > (M,do) is Lipschitz 
continuous with Lipschitz constant one and so every dp-open subset of M 
is d-open. Conversely, let U C M be a d-open subset of M and let pp € U 
and e > 0. Then, by Lemma 4.2.7, there exists a constant 6 > 0 such that 
all p,q € M with |p — po| < 6 and |q — po| < 6 satisfy 


d(p,q) < (1+ €)|p— 4. 
Since U is d-open, there exists a constant p > 0 such that 
Bp(po, 4) C U. 
With 


po := min a 
l+e 


this implies B,.(po,do) C U. Namely, if p € M satisfies 


|p — po| < po <4, 


then 
d(p,po) < (1+ €)|p — po| < (1+ €)po < p 


and so p € U. Thus U is dp-open and this proves that (i) is equivalent to (ii). 

That (ii) implies (iii) follows from the fact that each coordinate chart ¢o 
is a homeomorphism. To prove that (iii) implies (i), we argue indirectly 
and assume that U is not d-open. Then there exists a sequence p, € M \U 
that converges to an element po € U. Let $9 : Up 4 Qo be a coordinate 
chart with pp € Up. Then lim,_,.|¢o(pv) — ¢0(po)| = 0 by Lemma 4.2.8. 
Thus ¢9(Up NU) is not open and so U does not satisfy (iii). ~"E 
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Figure 4.3: Locally, M is the graph of f. 


Proof of Lemma 4.2.7. By Remark 4.2.2 the estimate |p — q| < d(p,q) holds 
for all p,q € M. The lemma asserts that, for all pp € M and all ¢ > 0, there 
exists a dg-open neighborhood Up C M of po such that all p,q € Up satisfy 


Ip — | < d(p,q) < (1+e)|p— I. (4.2.6) 


Let po € M and e > 0, and define x: R" + T,,M and y: R" > Ta by 


x(p):=I1(po)(p— po),  — y(p) := (1 — II (po)) (p — po), 


where II(po) : R” — T,,M denotes the orthogonal projection as usual. 
Then the derivative of the map z|y : M — T,,M at p = po is the iden- 
tity on T,,M. Hence the Inverse Function Theorem 2.2.17 asserts that 
the map z|y:M —T,,M is locally invertible near po. Extending this 
inverse to a smooth map from J,,/M to R” and composing it with the 
map y: M > TM, we obtain a smooth map 


ft TooM > Ty)M+ 
and an open neighborhood W C R” of po such that 
peM = y(p) = f(x(p)) 
for all p € W (see Figure 4.3). Moreover, by definition the map f/f satisfies 


f(0)=0¢eT, M+, df(0)=0:7,,M3T,,M-. 


Hence there exists a constant 6 > 0 such that, for every x € T,,M, we have 


ee 
2k =< eereewadiawie ap S222. 
OABET py M [Z| 
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Define 
Uo := {p€ MNW | |x(p)| < 6}. 


Given p,q € Up let 7 : [0,1] > M be the curve whose projection to the z-axis 
is the straight line joining x(p) to x(q), i.e. 


x(y(t)) = x(p) + t(w(q) — w(p)) =: a(t), 
y(y(t)) = Flay) = Fw) =: ¥@)- 
Then y(t) € Uo for all t € [0,1] and 


n= fle + y(t)| dt 


= i le(t) + df (w(t))@(t)| at 
< : (1+ llaf(x(t))||) [ee)| at 
0 


<(1+e) ] |&(t)| dt 
0 
= (1 +) |x(p) — «(q)| 
= (1 + ¢) |II(po)(p — 9)| 
<(1+e)|p-4l 
Hence d(p,q) < L(y) < (1+ €)|p—4| and this proves Lemma 4.2.7. 


Proof of Lemma 4.2.8. By assumption we have 
|dGo(po)v| = || 
for all v € T,,M. Fix aconstant ¢ > 0. Then, by continuity of the derivative, 
there exists a do-open neighborhood Mp C M of po such that for all p € Mo 
and all v € T,M we have 
(1 —€) |dgo(p)v| < |u| < (1 + €) |ddo(p) J . 
Thus for every curve y : [0,1] > Mo we have 
(1 —e)L(G007)) < L(y) < A +e)L(¢0 07). 

One is tempted to take the infimum over all curves ¥ : [0,1] + Mo joining 
two pints p,q € Mo to obtain the inequality 

(1 —) |do(p) — ¢0(@)| < d(p,¢) < (1 + €) |d0(p) — d0(4)I- (4.2.7) 


However, we must justify these inequalities by showing that the infimum 
over all curves in Mp agrees with the infimum over all curves in M joining 
the points p and q. 
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It suffices to show that the inequalities hold on a smaller heighbor- 
hood M, C Mo of po. Choose such a smaller neighborhood M, such that the 
open set ¢9(M;) is a convex subset of Qo. Then the right inequality in (4.2.7) 
follows by taking the curve y : [0,1] > M; from 7(0) = p to 7(1) = q such 
that ¢9 07: [0,1] + ¢d9(M,) is a straight line. To prove the left inequality 
in (4.2.7) we use the fact that Mo is d-open by Lemma 4.2.7. Hence, after 
shrinking M, if necessary, there exists a constant r > 0 such that 


po € M, e B,(po, d) C B3,(po, d) Cc Mo. 


Then, for p,q € M1 we have d(p,q) < 2r while L(y) > 4r for any curve + 
from p to q which leaves Mo. Hence the distance d(p,q) of p,q € My; is the 
infimum of the lengths L(y) over all curves ¥ : [0,1] + Mo that join y(0) = p 
to y(1) = q. This proves the left inequality in (4.2.7) and Lemma 4.2.8. 


A next question one might ask is the following. Can we choose a coor- 
dinate chart 6: U > Q on M with values in an open set 2 C R™ so that 
the length of each smooth curve y : [0,1] + U is equal to the length of the 
curve c:= ¢0 7: [0,1] ~ Q? We examine this question by considering the 
inverse map w := ¢ !:Q—U. Denote the components of x and w(x) by 


e=(cl,...,2™ EQ, V(x) =(v(2),...,~%(z)) EU. 


Given a smooth curve [0,1] 3 2: t 4 c(t) = (cl(t),...,e(t)) we can write 
the length of the composition y = ~oc: [0,1] + M in the form 
Eg 
Liwoe) = [| Foe] a 
0 
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Here the functions g;; : ( — R are defined by 


oe ee a (3 ia (2)). (4.2.8) 


4 Ox? Oxs Ox?” Ox 
v= 


Thus we have a smooth function g = (gj;) : Q > R”*™ with values in the 
positive definite matrices given by g(x) = dy)(x)'dy)(x) such that 


1 
Paes / elt) (elt) a(t) at (4.2.9) 


for every smooth curve c: [0,1] > Q. Thus the condition L(7 oc) = L(c) 
for every such curve is equivalent to 


Gig (@) = di 
for all x € Q or, equivalently, 
dyy(a)"dyy(x) = 1. (4.2.10) 


This means that ~ preserves angles and areas. The next example shows that 
for M = S? it is impossible to find such coordinates. 


Figure 4.4: A spherical triangle. 


Example 4.2.10. Consider the manifold M = S?. If there is a diffeomor- 
phism 7 : Q — U from an open set 2 C R? onto an open set U C S? that 
satisfies (4.2.10), it has to map straight lines onto arcs of great circles and 


it preserves the area. However, the area A of a spherical triangle bounded 
by three arcs on great circles satisfies the angle sum formula 


at+P+y=a7+A. 


(See Figure 4.4.) Hence there can be no such map w. 
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4.3. The Exponential Map 


4.3.1 Geodesic Spray 


The tangent bundle TM is a smooth 2m-dimensional manifold in R” x R” 
by Corollary 2.6.12. The next lemma characterizes the tangent bundle of 
the tangent bundle. Compare this with Lemma 3.4.5. 


Lemma 4.3.1. The tangent space of TM at (p,v) € TM is given by 


pe€T,M and } 
(1—I(p))0 = hp(p,v) f° 


Proof. We prove the inclusion “Cc” in (4.3.1). Let (p,0) € Tip.)TM and 
choose a smooth curve R > TM : t + (y(t), X(t)) such that 


VO)=p, XO)=v, 40)= X(0) =6. 
Then the Gau$—Weingarten formula (3.2.2) asserts that 


X(t) = VX(t) + hy (FV), X (0) 


Tw)I'M = {@ 3) ER” x R” (4.3.1) 


and hence (1 — II(7(t)))X(t) = hay (¥(t), X(t)) for allt eR. Take t=0 
to obtain (1 — II(p))v = h,(p,v). This proves the inclusion “C” in (4.3.1). 
Equality holds because both sides of the equation are 2m-dimensional linear 
subspaces of R" x R”. 


By Lemma 4.3.1 asmooth map S = (51, $2): TM — R” x R” is a vector 
field on T'M if and only if 


Si(p,v) €Tp>M, (1 —II(p)) S2(p, v) = hp(S1(p, v), v) 


for all (p,v) € TM. A special case is where $\(p,v) = v. Such vector fields 
correspond to second order differential equations on M. 


Definition 4.3.2 (Spray). A vector field S € Vect(T'M) is called a spray 
iff it has the form S(p,v) = (v,S2(p,v)) where So: TM — R” is a smooth 
map satisfying 


(1 — I(p))S2(p, v) = hp(v, v), So(p, Av) = A*So(p, v) (4.3.2) 
for all (p,v) € TM and X ER. The vector field S € Vect(TM) defined by 
S(p,v) = (v,hp(v,v)) € To »TM (4.3.3) 


forp€ M andv €T,M is called the geodesic spray. 
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4.3.2. The Exponential Map 


Lemma 4.3.3. Lety: I — M be a smooth curve on an open interval I CR. 
Then y is a geodesic if and only if the curve I+ TM :t+ (y(t), 7(t)) as 
an integral curve of the geodesic spray S in (4.3.3). 


Proof. A smooth curve I + TM : t+ (4(t), X(t)) is an integral curve of S 
if and only if 
Y(th=X(t), X(t) =hyw (X(t), XM) 


for allt € J. By equation (4.1.5), this holds if and only if 7 is a geodesic 
and y= X. 


Combining Lemma 4.3.3 with Theorem 2.4.7 we obtain the following 
existence and uniqueness result for geodesics. 


Lemma 4.3.4. Let MC R” be an m-dimensional submanifold. 


(i) For every p € M and every v € T,M there is an e > 0 and a smooth 
curve y :(—ée,€) > M such that 


v=, 7(0) = p, 4(0) =v. (4.3.4) 
(GI) Ifyi: th, > M and 2: In > M are geodesics and to € 1, N12 with 
(to) = Y2(to), 41 (to) = 42(to), 
then y1(t) = y(t) for allt EL NIy. 
Proof. Lemma 4.3.3 and Theorem 2.4.7. 


Definition 4.3.5 (Exponential map). For p€ M and v €T,M the in- 
terval 


a U Fz I is an open interval containing 0 and there is a 
a geodesic y: I + M satisfying y(0) = p, 7(0) =v 


is called the maximal existence interval for the geodesic through p in the 
direction v. For p€ M define the set V, C T,M by 


Vo(t= (0 2 M1 € tao} (4.3.5) 
The exponential map at p is the map 
exp, : Vp, > M 


that assigns to every tangent vector v € V, the point exp,(v) := (1), 
where 7: In, + M is the unique geodesic satisfying y(0) = p and ¥(0) =v. 
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Figure 4.5: The exponential map. 


Lemma 4.3.6. (i) The set 
V :={(p,v) |pEe M, vEV,>} CTM 


is open and the map V + M : (p,v) + exp,(v) is smooth. 
(ii) [fp <M andv € Vp, then 


Inn = {t € R| tu € Vp} 
and the geodesic 7 : Ip,» + M with 7(0) =p and ¥(0) = v is given by 
y(t) = exp, (tv), EE Deyeaps 


Proof. Part (i) follows directly from Lemma 4.3.3 and Theorem 2.4.9. To 
prove part (ii), fix an element p€ M and a tangent vector v eVey. and 
let 7 : Ip» + M be the unique geodesic with 7(0) = p and 4(0) = v. Fix a 
nonzero real number A and define the map 7 : aes ee — M by 


“3, (E) = (AB) fort € A lIpw. 
Then 4,(t) = A¥(At) ans 4(t) = A24(At) and hence 
Via(t) = U(ya(t))Fa(é) = 7I((At)) FOAL) = VIE) = 0 
for every t € \~'I,. This shows that 7 is a geodesic with 
ya(0) =p, — Fa(0) = Av. 


In particular, we have \~'I,, C Ip,v- Interchanging the roles of v and Av 
we obtain a ee = Ip dy. Thus 


A € tow => ae Mer => Av € Vp 


and 
(A) = y(1) = exp, (Av) 
for A € Ip». This proves Lemma 4.3.6. 
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Since exp,(0) = p by definition, the derivative of the exponential map 
at v = 0 is a linear map from 7, to itself. This derivative is the identity 
map as illustrated in Figure 4.5 and proved in the following corollary. 


Corollary 4.3.7. The map exp, : V» + M is smooth and its derivative at 
the origin is dexp,(0) =id: T,M > T,M. 


Proof. The set V, is an open subset of the linear subspace 7,M C R", 
with respect to the relative topology, and hence is a manifold. The tan- 
gent space of V, at each point is 7,M. By Lemma 4.3.6 the exponential 
map exp, : V, + M is smooth and its derivative at the origin is given by 


dexp,(0)v = exp,(tu) = ¥(0) = v, 


Ot ay 


where y : Ip» — M is once again the unique geodesic through p in the 
direction v. This proves Corollary 4.3.7. 


Corollary 4.3.8. Let p € M and, for r > 0, denote 
By(p) = {v € TyM | [vl <r}. 
Ifr > 0 is sufficiently small, then B,(p) C Vp, the set 
Ur(p) := exp,(B,(p)) 


is an open subset of M, and the restriction of the exponential map to B,.(p) 
is a diffeomorphism from B,(p) to U;(p). 


Proof. This follows directly from Corollary 4.3.7 and Theorem 2.2.17. 


Definition 4.3.9 (Injectivity radius). Let M Cc R” be a smooth m- 
manifold. The injectivity radius of M at p is the supremum of all real 
numbers r > 0 such that B,(p) C Vp and the restriction of the exponential 
map exp, to B,(p) is a diffeomorphism onto its image 


U,(p) = exp,(B,(p)). 
It will be denoted by 


B,(p) C Vp and 
inj(p) := inj(p; M) := sup 4 r > 0} exp, : B,(p) > U,(p) 
is a diffeomorphism 


The injectivity radius of M is the infimum of the injectivity radii of M 
at p over allp © M. It will be denoted by 


inj(M) := aa inj(p; M). 
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4.3.3. Examples and Exercises 
Example 4.3.10. The exponential map on R”™ is given by 
exp,(v) =p+vu for p,v € R™. 


For every p € R™ this map is a diffeomorphism from T,R™ = R™ to R™ and 
hence the injectivity radius of R™ is infinity. 


Example 4.3.11. The exponential map on S$” is given by 


sin(|v|) 
|v| 


for every p € S™ and every nonzero tangent vector v € T,$™ = pt. The re- 
striction of this map to the open ball of radius r in T;,M is a diffeomorphism 
onto its image if and only if r <a. Hence the injectivity radius of S™ 
at every point is 7. Exercise: Given p€ S” and 0AvET7,S™ =p", 
prove that the geodesic y: R- S™ with 7(0) =p and ¥(0) =v is given 
by y(t) = cos(t |v|)p + sme for t € R. Show that, in the case 0 < |v| < a 
there is no shorter curve in S”™ connecting p and gq and deduce that the 
intrinsic distance on S$” is given by d(p, q) = cos~!((p,q)) for p,q € S™ (see 


Example 4.2.5 for m = 2). 


expp(v) = cos(|v|)p + 


Example 4.3.12. Consider the orthogonal group O(n) C R”*” with the 
standard inner product (v,w) := trace(v'w) on R"*". The orthogonal 
projection II(g) : R°*" — T,O(n) is given by 


Ig) s= : (v — ) 


and the second fundamental form by 
hg(v, v) =—gu'v. 


Hence a curve y : R > O(n) is a geodesic if and only if y'7 + 4+'¥y =0 or, 
equivalently, y'¥ is constant. This shows that geodesics in O(n) have the 
form y(t) = gexp(t€) for g € O(n) and € € o(n). It follows that 


exp,(v) = gexp(g_‘v) = exp(vg*)g 


for g € O(n) and v € T,O(n). In particular, for g = 1 the exponential 
map exp, : 0(n) — O(n) agrees with the exponential matrix. 


Exercise 4.3.13. What is the injectivity radius of the 2-torus T? = $1 x $1, 
the punctured 2-plane R? \ {(0,0)}, and the orthogonal group O(n)? 
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Geodesics in Local Coordinates 


Lemma 4.3.14. Let M CR” be an m-dimensional manifold and choose a 
coordinate chart 6: U > Q with inverse 


w:=G1:050. 


Let ry :Q > R be the Christoffel symbols defined by (3.6.6) and letc: I+ Q 
be a smooth curve. Then the curve 


y:=woc:IloM 
is a geodesic if and only if c satisfies the 2nd order differential equation 
m 
f+ S TE oéé =0 (4.3.6) 
ij=l 
fork =1,...,m. 


Proof. This follows immediately from the definition of geodesics and equa- 
tion (3.6.7) in Lemma 3.6.1 with X = ¥ and € =¢. 


We remark that Lemma 4.3.14 gives rise to another proof of Lemma 4.3.4 
that is based on the existence and uniqueness of solutions of second order 
differential equations in local coordinates. 


Exercise 4.3.15. Let Q C R™ be an open set and g = (gj) : Q 4 R™*<™ 
be a smooth map with values in the space of positive definite symmetric 
matrices. Consider the energy functional 


on the space of paths c: [0,1] > Q, where L : Q x R™ — R is defined by 


12,8) = 5 D> Soul@e. (4.3.7) 


ij=l 
The Euler—Lagrange equations of this variational problem have the form 


Le On ge 3 
ae age (CCA) = a e(lO.O), R= 1. (4.3.8) 


Prove that the Euler-Lagrange equations (4.3.8) are equivalent to the geo- 
desic equations (4.3.6), where the rk. :Q > R are given by (3.6.10). 
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4.4 Minimal Geodesics 


4.4.1 Characterization of Minimal Geodesics 


Any straight line segment in Euclidean space is the shortest curve joining its 
endpoints. The analogous assertion for geodesics in a manifold M is false; 
consider for example an arc which is more than half of a great circle on a 
sphere. In this section we consider curves which realize the shortest distance 
between their endpoints. 


Lemma 4.4.1. Let I = [a,b] be a compact interval, let y:I + M be a 
smooth curve, and define p:= (a) and q:=(b). Then the following are 
equivalent. 


(i) y is parametrized proportional to the arclength, t.e. |¥(t)| = c is constant, 
andy minimizes the length, t.e. L(y) < L(7) for every smooth curve 7 in M 
joming p and q. 

(ii) y minimizes the energy, t.e. E(y) < E(o’) for every smooth curve 7’ 
in M joining p and q. 


Proof. See page 198. 


Definition 4.4.2 (Minimal geodesic). A smooth curve y:I + M ona 
compact interval IC R is called a minimal geodesic iff it satisfies the 
equivalent conditions of Lemma 4.4.1. 


Remark 4.4.3. (i) Condition (i) says that (the velocity |+| is constant 
and) L(y) = d(p,q), i-e. that 7 is a shortest curve from p to q. It is not 
precluded that there be more than one such y; consider for example the 
case where M is a sphere and p and gq are antipodal. 


(ii) Condition (ii) implies that 


d 
= E(y;) = 
7s an (Ys) =0 


for every smooth variation R x I > M:s+5 4,(t) of y with fixed endpoints. 
Hence a minimal geodesic is a geodesic. 


(iii) Finally, we remark that L(y) (but not E(7)) is independent of the 
parametrization of 7. Hence, if y is a minimal geodesic, then L(y) < L(7’) 
for every 7’ (from p to q) whereas E(y) < E (7) for those 7 defined on (an 
interval the same length as) I. 
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Proof of Lemma 4.4.1. We prove that (i) implies (ii). Let c be the (constant) 
value of |¥(t)|. Then 


L(y) =(b-a)e, = E() = 


Then, for every smooth curve 7’ : I + M with 7/(a) = p and 9/(b) = q, we 
have 


4E(y)? = L(y)” 
z 2L(7')? 


=¢ (f el iv) 


b 
2 eh a) | W(t)? at 
= 2(b — a)’ E(7) 
= 4E(y)E()’). 


Here the fourth step follows from the Cauchy—Schwarz inequality. Now 
divide by 4E(y) to obtain E(7) < E(’). 

We prove that (ii) implies (i). We have already shown in Remark 4.4.3 
that (ii) implies that y is a geodesic. It is easy to dispose of the case where 
M is one-dimensional. In that case any y minimizing E'(y) or L(y) must be 
monotonic onto a subarc; otherwise it could be altered so as to make the 
integral smaller. Hence suppose M is of dimension at least two. Suppose, 
by contradiction, that L( 7’) < L(y) for some curve y’ from p to q. Since 
the dimension of M is bigger than one, we may approximate 7 by a curve 
whose tangent vector nowhere vanishes, i.e. we may assume without loss of 
generality that ¥(t) 4 0 for all t. Then we can reparametrize y’ proportional 
to arclength without changing its length, and by a further transformation 
we can make its domain equal to J. Thus we may assume without loss of 
generality that 7/ : I > M is a smooth curve with y(a) = p and ¥(b) = q 
such that |7‘(t)| = c’ and 


(b—a)c = L(y’) < L(y) = (b-a)e. 
This implies c’ < ¢ and hence 


(b—a)c?— (b—a)c? 


BO) = < SEE = (0). 


This contradicts (ii) and proves Lemma 4.4.1. 
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4.4.2 Local Existence of Minimal Geodesics 


The next theorem asserts the existence of minimal geodesics. It shows 
that the set U,(p) = exp,(B,(p)) in Definition 4.3.9 is actually the open 
ball U,.(p) = {q € M|d(p, q) < r} whenever r < inj(p; M). 


Theorem 4.4.4 (Existence of minimal geodesics). Let M Cc R” be a 
smooth m-manifold, fix a point p € M, and let r > 0 be smaller than the 
injectivity radius of M at p. Letv © T,M such that |v| <r. Then 


d(p, q) = |v, q := exp,(v), 


and a curve y € Qyq has minimal length L(y) = |v| if and only if there is a 
smooth map 2 : [0,1] — [0,1] satisfying 


B0)=0, pj=1, B20 
such that y(t) = exp,(G(t)v) forO<t<1. 


Proof. See page 201. 


Uy 


Figure 4.6: The Gau8 Lemma. 


Lemma 4.4.5 (Gau8 Lemma). Let M, p, r be as in Theorem 4.4.4, 
let I CR be an open interval, and let w: I — Vp be a smooth curve whose 
norm 
Jw(t)| =r 
is constant. Define 
a(s,t) := exp,(sw(t)) 
for (s,t) € Rx TI with sw(t) € Vp. Then 


da da\ _ 4 
Os’ Ot/ 


Thus the geodesics through p are orthogonal to the boundaries of the embed- 
ded balls U,(p) in Corollary 4.3.8 (see Figure 4.6). 
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Proof of Lemma 4.4.5. For every t € I we have 
a(0,t) = exp,(0) = p 
and so the assertion holds for s = 0, i.e. 


(20.9, 20.0) =0. 


Moreover, each curve s ++ a(s,t) is a geodesic, i.e. 


By Theorem 4.1.4, the function 
Oa 
t+ |>—(s,t 
s | a (5; ] 


is constant for every t, so that 


@ (0a da _ 
Os \ Os’ Ot 7 


= 0. 


Since the function (2, a) vanishes for s = 0 we obtain 


(Z(t Slet)) =0 


for all s and t. This proves Lemma 4.4.5. 
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Proof of Theorem 4.4.4. Let r > 0 be as in Corollary 4.3.8 and let v € T,M 
such that 0 < |v| =: ¢ < r. Denote q := exp,(v) and let 7 € Np. Assume 
first that 


y(t) € exp, (Be(p)) = Ue Vt e [0,1]. 


Then there is a unique smooth function [0,1] > T,M : t+ v(t) such that 
|u(t)| < € and y(t) = exp,(v(t)) for every t. The set 


T= {t € [0,1] | y(t) # p} = {t € [0,1] | vt) FO} c (0,0) 


is open in the relative topology of (0, 1]. Thus J is a union of open intervals 
in (0,1) and one half open interval containing 1. Define £6 : [0,1] > [0,1] 
and w: I —T,M by 


and 
lw(t)| =e, — y(t) = exp, (8(t)w(¢)) 
for all t € I. We prove that L(y) >. To see this let a: [0,1] x I > M be 
the map of Lemma 4.4.5, i.e. 
a(s,t) := exp,(sw(t)). 
Then y(t) = a(G(t),t) and hence 


alt) = BOZO, + 20,1 


for every t > 0. Hence it follows from Lemma 4.4.5 that 


2 
+ 


2 
2 > B(t)?e? 


5 (3(0)2) 


for every t € I. Hence 


La)= f wola= frolaze [po ae faoa=c 
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Here the last equation follows by applying the fundamental theorem of cal- 
culus to each interval in J and using the fact that 6(0) = 0 and §(1) = 1. 
If L(y) =, we must have 


O 
Fy (88) =0, £B(t)>0 for all t € I. 
Thus J is a single half open interval containing 1 and on this interval the 
0a 


condition 4* (A(t), t) = 0 implies w(t) = 0. Since w(1) = v we have w(t) = v 
for every t € I. Hence y(t) = exp,((t)v) for every t € [0,1]. It follows 
that 8 is smooth on the closed interval [0,1] (and not just on J). Thus we 
have proved that every 7 € Qp,q with values in U. has length L(y) > ¢ with 
equality if and only if y is a reparametrized geodesic. But ify € Qp,, does not 
take values only in U-, there must be a T € (0,1) such that 7([0,T]) C U- 
and 7(T) € OU-. Then L(y\o,7)) = €, by what we have just proved, 
and L(¥|;71)) > 0 because the restriction of y to [7,1] cannot be constant; 
so in this case we have L(y) > ¢. This proves Theorem 4.4.4. 


The next corollary gives a partial answer to our problem of finding length 
minimizing curves. It asserts that geodesics minimize the length locally. 


Corollary 4.4.6. Let MCR” be a smooth m-manifold, let IC R be an 
open interval, and let y: 1 + M be a geodesic. Fix a point tp) € I. Then 
there exists a constant e > 0 such that 


pHee Pet ste = L(y|[s,t)) = a(y(s), Y(t)). 


Proof. Since ¥ is a geodesic its derivative has constant norm |7(t)| = c (see 
Theorem 4.1.4). Choose 6 > 0 so small that the interval [to — 6, to + 6] is 
contained in J. Then there is a constant r > 0 such that r < inj(7(t)) 
whenever |t — to| < 6. Choose ¢ > 0 such that 


E<d, 2ec <1. 
Iftp —E<s<t<to+e, then 
y(t) = exp,s) ((t — 8)¥(s)) 


and 
|(t — s)¥(s)| = |t — s|c < 2ec < r < inj(7(s)). 


Hence it follows from Theorem 4.4.4 that 


L(l[s,4)) = lt — sle = a(7(s), 1@))- 


This proves Corollary 4.4.6. 
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4.4.3. Examples and Exercises 


Exercise 4.4.7. How large can the constant ¢ in Corollary 4.4.6 be chosen 
in the case M = S?? Compare this with the injectivity radius. 


Remark 4.4.8. We conclude from Theorem 4.4.4 that 
S(p) = {q € M|d(p,q) =r} = exp, ({v € TpM | |v| = r}) (4.4.1) 


for 0<r<inj(p;M). The Gau8 Lemma 4.4.5 shows that the geodesic 
rays [0,1] + M:s++ exp,(sv) emanating from p are the orthogonal tra- 
jectories to the concentric spheres S;(p). 


Exercise 4.4.9. Let 
M CR? 


be of dimension two and suppose that M is invariant under the (orthogonal) 
reflection about some plane E C R?. Show that E intersects M in a geodesic. 
(Hint: Otherwise there would be points p,q € M very close to one an- 
other joined by two distinct minimal geodesics.) Conclude for example that 
the coordinate planes intersect the ellipsoid (x/a)? + (y/b)? + (z/c)? = 1 in 
geodesics. 


Exercise 4.4.10. Choose geodesic normal coordinates near p € M via 


q = XP, (> “a 
i=l 


where €1,...,€m is an orthonormal basis of T,M (see Corollary 4.5.4 below). 
Then we have x’(p) = 0 and 


m 
B,(p) = {qe M|d(p,q) <r} = f € M| S-|z“(@)|’ < et (4.4.2) 
i=1 
for 0 <r <inj(p;M). Hence Theorem 4.5.3 below asserts that B,(p) is 
convex for r > 0 sufficiently small. ar 
(i) Show that it can happen that a geodesic in B,(p) is not minimal. Hint: 
Take M to be the hemisphere {(z,y,z) € R?|a? + y? +27 =1, z > 0} to 
gether with the disc {(x, y, z) € R°?| 2? + y? < 1, z = 0}, but smooth the cor- 
ners along the circle 2? + y? =1, z = 0. Take p = (0,0,1) and r = 7/2. 
(ii) Show that, if r >0 is sufficiently small, then the unique geodesic y 
in B,(p) joining two points q,q € B,(p) is minimal and that in fact any 
curve 7’ from q to qd’ which is not a reparametrization of 7¥ is strictly longer, 


i.e. L(7’') > L(y) =d(q,¢’). 
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Exercise 4.4.11. Let y : I = [a,b] + M be a smooth curve with end- 
points y(a) = p and y(b) = q and nowhere vanishing derivative, i.e. y(t) 4 0 
for all t € J. Prove that the following are equivalent. 
(i) The curve 7 is an extremal of the length functional, i.e. every 
smooth map Rx I > M:(s,t)> y(t) with y,(a) =p and y,(b) = q for 
all s satisfies ; 

qs Os) La 0. 
(ii) The curve y is a reparametrized geodesic, i.e. there exists a smooth 
map o: [a,b] > [0,1] with o(a) = 0, o(b) = 1, o(t) > 0 for allt EJ, anda 
vector v € T,M such that 


q=exp,(v), y(t) = exp,(o(t)v) 


for all t€ 7. (We remark that the hypothesis +(t) 4 0 implies that o is 
actually a diffeomorphism, i.e. a(t) > 0 for all t € I.) 


(iii) The curve y minimizes the length functional locally, i.e. there ex- 
ists an € > 0 such that L(7y|js5.4) = d(y(s), (t)) for every closed subinter- 
val [s,t] C I of length t—s <e. 


It is often convenient to consider curves 7 where 4(t) is allowed to vanish 
for some values of t; then y cannot (in general) be parametrized by arclength. 
Such a curve 7: J > M can be smooth (as a map) and yet its image may 
have corners (where ¥ necessarily vanishes). Note that a curve with corners 
can never minimize the distance, even locally. 


Exercise 4.4.12. Show that conditions (ii) and (iii) in Exercise 4.4.11 are 
equivalent, even without the assumption that 7 is nowhere vanishing. De- 
duce that, if y: J + M is a shortest curve joining p to q, i.e. L(y) = d(p,q), 
then y is a reparametrized geodesic. 

Show by example that one can have a variation {7;},cr of a reparame- 
trized geodesic yo = y for which the map s +> L(7z) is not even differentiable 
at s =0. (Hint: Take y to be constant. See also Exercise 4.1.9.) 

Show, however, that conditions (i), (ii) and (iii) in Exercise 4.4.11 remain 
equivalent if the hypothesis that 7 is nowhere vanishing is weakened to the 
hypothesis that 7+(t) 4 0 for all but finitely many t € J. Conclude that a bro- 
ken geodesic is a reparametrized geodesic if and only if it minimizes arclength 
locally. (A broken geodesic is a continuous map y: J = [a,b] > M for 
which there exist a = to < t] < +++ <t, =b such that 7|,_, 4,) is a geodesic 
fori =1,...,n. It is thus a geodesic if and only if 7 is continuous at the 
break points, ie. ¥(t; ) = 7(¢/) fori=1,...,n—1.) 
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4.5 Convex Neighborhoods 


A subset of an affine space is called convex iff it contains the line segment 
joining any two of its points. The definition carries over to a submanifold M 
of Euclidean space (or indeed more generally to any manifold M equipped 
with a spray) once we reword the definition so as to confront the difficulty 
that a geodesic joining two points might not exist nor, if it does, need it be 
unique. 


Definition 4.5.1 (Geodesically convex set). Let M CR” be a smooth 
m-dimensional manifold. A subset U C M is called geodesically con- 
vex iff, for all po,pi € U, there exists a unique geodesic y : [0,1] > U such 
that 1(0) = po and y(1) = pr. 


It is not precluded in Definition 4.5.1 that there be other geodesics from p 
to q which leave and then re-enter U, and these may even be shorter than 
the geodesic in U. 


Exercise 4.5.2. (a) Find a geodesically convex set U in a manifold M and 
points po, pi € U such that the unique geodesic ¥ : [0,1] + U with 7(0) = po 
and y(1) =i has length L(y) > d(po,pi). Hint: An interval of length 
bigger than 7 in S*. 


(b) Find a set U in a manifold M such that any two points in U can be 
joined by a minimal geodesic in U, but U is not geodesically convex. Hint: 
A closed hemisphere in S?. 


Theorem 4.5.3 (Convex Neighborhood Theorem). Let M Cc R” be a 
smooth m-dimensional submanifold and fix a point pp € M. Let 6: U —Q 
be any coordinate chart on an open neighborhood U C M of po with values 
in an open set Q C R™. Then the set 


Ur := {p €U| |b(p) — b(po)| <r} (4.5.1) 


is geodesically convex for r > 0 sufficiently small. 


Proof. See page 206. 


Corollary 4.5.4. Let MCR” be a smooth m-manifold and let pp © M. 
Then, forr > 0 sufficiently small, the open ball 


U,(po) = {p € M | d(po, p) < r} (4.5.2) 


is geodesically convex. 
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Proof. Choose an orthonormal basis e1,...,€m Of T>,M and define 


Q := {x € R™| |x| < inj(po; M)}, 
U := {p € M|d(po, p) < inj(po; M)}. 
Define the map w :Q— U by 


W(x) := exp,, ( vs) (4.5.4) 
i=1 


for x = (z!,...,2™) €Q. Then w is a diffeomorphism and d(po, w(x)) = |a| 
for all c € Q by Theorem 4.4.4. Hence its inverse 


(4.5.3) 


gay: sa (4.5.5) 
satisfies (po) = 0 and |¢(p)| = d(po, p) for all p € U. Thus 


Ur(po) = {p € U| |e(p) — P(po)| <r} for 0 < r < inj(po; M) 


and so Corollary 4.5.4 follows from Theorem 4.5.3. 


Definition 4.5.5 (Geodesically normal coordinates). The coordinate 
chart 6: U + Q in (4.5.4) and (4.5.5) sends geodesics through po to straight 


lines through the origin. Its components x!,...,2’™ : U — R are called geo- 
desically normal coordinates at po. 


Proof of Theorem 4.5.8. Assume without loss of generality that ¢(po) = 0. 
Let rk : Q — R be the Christoffel symbols of the coordinate chart and, 
for x € Q, define the quadratic function Qz : R™ > R by 


a6) =o () — YO athe’. 


k=1 i,j,k=1 


Shrinking U, if necessary, we may assume that 


1 


for all x EQ. 
2m 


Then, for all 2 € 2 and all € € R™ we have 


Wi 2 
Qz(€) > 12 - = (3 > SleP>0. 


2m \¢4 
ll 


Hence Q,. is positive definite for every x € (2. 
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Now let 7 : [0,1] + U be a geodesic and define 


c(t) == o(y(t)) 


for0 <t<1. Then, by Lemma 4.3.14, c satisfies the differential equation 


Hence ae 
; F 2 : c 
“5 = =60 = +eo=0.102 a >0 
and so the function t ++ |¢(y(t))|* is convex. Thus, if y(0),y(1) € U; for 
some r > 0, it follows that y(t) € U, for all t € [0, 1]. 
Consider the exponential map 


V={(p,v) €TM |v € Vp} > M: (p,v) + exp,(v) 


in Lemma 4.3.6. Its domain V is open and the exponential map is smooth. 
Since it sends the pair (po,0) € V to exp,,(0) = po € U, it follows from con- 
tinuity that there exist constants « > 0 and r > 0 such that 


peu,, ve TpM, |v| <eé = vEVp, exp,(v) €U. (4.5.6) 
Moreover, we have 
dexp,, (0) = id : Tp, M — Tp,M 


by Corollary 4.3.7. Hence the Implicit Function Theorem 2.6.15 asserts that 
the constants « > 0 and r > 0 can be chosen such that (4.5.6) holds and there 
exists a smooth map h: U, x U; > R” that satisfies the conditions 


h(p,q)€TpM, — |A(p, ag) <e (4.5.7) 
for all p,g € U, and 
exp,(v) = q = v = h(p,q) (4.5.8) 
for all p,q € U, and all v € T,M with |v| < e. In particular, we have 
h(po, Po) = 9 


and exp,(h(p, q)) = q for all p,q € Uy. 
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Fix two constants € > 0 and r > 0 and a smooth map h: U, x U, — R" 
such that (4.5.6), (4.5.7), (4.5.8) are satisfied. We show that any two 
points p,q € U, are joined by a geodesic in U,. Let p,q € U;, and define 


y(t) == exp,(th(p, q)) for OS 7 1. 
This curve y : [0,1] — M is well defined by (4.5.6) and (4.5.7), it is a 


geodesic satisfying 7(0) = p € U, by Lemma 4.3.6, it satisfies y(1) = q € U; 
by (4.5.8), it takes values in U by (4.5.6) and (4.5.7), and so 7({0, 1]) CU, 
because the function [0,1] + R: t+ |4(7(t))|? is convex. 

We show that there exists at most one geodesic in U, joining p and q. 
Let p,q €U, and let y: [0,1] > U, be any geodesic such that y(0) =p 
and (1) =q. Define v := 4(0) € T,M. Then 7(t) = exp,(tv) for0 <t <1 
by Lemma 4.3.6. We claim that |v| < ¢. Suppose, by contradiction, that 


|u| >. 


Then 


i 24 
|0| 


and, for 0 < t < T, we have |tu| < € and exp, (tv) = y(t) € U; and so 
h(p, y(t)) = tw. 
by (4.5.8). Thus 


|h(p, y(€))| = tlv| for0<t<T. 
Take the limit ¢ 7 T to obtain 
|n(p, o(L))| = T |v] = € 


in contradiction to (4.5.7). This contradiction shows that |v| < e. Since 


exp,(v) = y(1) =q € U, 
it follows from (4.5.8) that v = h(p,q). This proves Theorem 4.5.3. 


Remark 4.5.6. Theorem 4.5.3 and its proof carry over to general sprays 
(see Definition 4.3.2). 


Exercise 4.5.7. Consider the set U,(p) = {¢ € M|d(p,q) <r} for pe M 
and r > 0. Corollary 4.5.4 asserts that this set is geodesically convex for r 
sufficiently small. How large can you choose r in the cases 


Meo, MeT=a=Sxs) MSR Wek ior 


Compare this with the injectivity radius. If the set U,(p) in these exam- 
ples is geodesically convex, does it follow that every geodesic in U,(p) is 
minimizing? 
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4.6 Completeness and Hopf—Rinow 


For a Riemannian manifold there are different notions of completeness. 
First, in §3.4 completeness was defined in terms of the completeness of time 
dependent basic vector fields on the frame bundle (Definition 3.4.10). Sec- 
ond, there is a distance function 


d: Mx M = (0,00) 


defined by equation (4.2.2) so that we can speak of completeness of the 
metric space (M,d) in the sense that every Cauchy sequence converges. 
Third, there is the question of whether geodesics through any point in any 
direction exist for all time; if so we call a Riemannian manifold geodesically 
complete. The remarkable fact is that these three rather different notions of 
completeness are actually equivalent and that, in the complete case, any two 
points in M can be joined by a shortest geodesic. This is the content of the 
Hopf-Rinow theorem. We will spell out the details of the proof for embedded 
manifolds and leave it to the reader (as a straight forward exercise) to extend 
the proof to the intrinsic setting. 


Geodesic Completeness 


Definition 4.6.1 (Geodesically complete manifold). Let M C R” be an 
m-dimensional manifold. Given a point p € M we say that M is geodesi- 
cally complete at p if, for every tangent vector v € T,M, there exists a 
geodesic y : R > M (on the entire real axis) satisfying y(0) = p and4+(0) =v 
(or equivalently Vp = T,M where V, C T,M is defined by (4.3.5)). The man- 
ifold M is called geodesically complete iff it is geodesically complete at 
every point pe M. 


Definition 4.6.2. Let (M,d) be a metric space. A subset AC M is called 
bounded iff 


sup d(p, po) < co 
pea 


for some (and hence every) point po © M. 
Example 4.6.3. A manifold M C R"” can be contained in a bounded subset 


of R” and still not be bounded with respect to the metric (4.2.2). An 
example is the 1-manifold M = {(a,y) € R?|0<a<1,y=sin(1/z)}. 


Exercise 4.6.4. Let (/,d) be a metric space. Prove that every compact 
subset K C M is closed and bounded. Find an example of a metric space 
that contains a closed and bounded subset that is not compact. 
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Theorem 4.6.5 (Completeness). Let M Cc R" be a connected m-dimen- 
sional manifold and let d: M x M — (0,00) be the distance function defined 
by (4.1.1), (4.2.1), and (4.2.2). Then the following are equivalent. 

(i) M is geodesically complete. 

(ii) There exists a point p € M such that M is geodesically complete at p. 
(iii) Every closed and bounded subset of M is compact. 

(iv) (M,d) is a complete metric space. 

(v) M is complete, i.e. for every smooth curve €: R— R™ and every ele- 
ment (po,eo) € F(M) there exists a smooth curve 8B: R— F(M) satisfying 


B(t) = Be (8), B(0) = (Po, 0). (4.6.1) 
(vi) The basic vector field Be € Vect(F(M)) is complete for every € € R™. 


(vii) For every smooth curve y/:R—R™, every po € M, and every or- 
thogonal isomorphism ®o : Tp, M — R™ there exists a development (®, 7,7’) 
of M along R™ on all of R that satisfies y(0) = po and ®(0) = Go. 


Proof. The proof relies on Theorem 4.6.6 below. 


Global Existence of Minimal Geodesics 


Theorem 4.6.6 (Hopf—Rinow). Let M C R” be a connected m-manifold 
and let p€ M. Assume M is geodesically complete at p. Then, for ev- 
ery q € M, there exists a geodesic y : [0,1] + M such that 


7(0) =p, (1) = 4, L(y) = d(p,q). 


Proof. See page 215. 


Theorem 4.6.6 implies Theorem 4.6.5. That (i) implies (ii) follows directly 
from the definitions. 

We prove that (ii) implies (iii). Thus assume that M is geodesically 
complete at the point po € M and let K C M be a closed and bounded 
subset. Then r := supgex d(po,q) < 00. Hence Theorem 4.6.6 asserts that, 
for every g € K, there exists a vector v € T,,M such that |v] = d(po,q) <r 
and exp,,(v) = q. Thus 


K Cexp,,(Br(po)), Br(po) = {uv € Tp M | |v] <r}. 


Then B := {v € Ty,M | |v| <r, exp,,(v) € A} is a closed and bounded sub- 
set of the Euclidean space T,,M. Hence B is compact and K = exp,,(B). 
Since the exponential map exp, : Tp) — M is continuous it follows that K 
is compact. This shows that (ii) implies (iii). 
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We prove that (iii) implies (iv). Thus assume that every closed and 
bounded subset of M is compact and choose a Cauchy sequence pj € M. 
Choose ig € N such that d(p;,p;) < 1 for all i,j € N with i,7 > ip. Define 

> mes apiaPs) + 1s 
Then d(p1, pi) < d(pi, Pio) + A(Pig, Pi) < d(p1, Pig) +1 < c for all 7 > io and 
so d(p1, pi) < c for alli € N. Hence the set {p;|i € N} is bounded and so 
is its closure. By (iii) this implies that the sequence p; has a convergent 
subsequence. Since p; is a Cauchy sequence, this implies that p; converges. 
Thus we have proved that (iii) implies (iv). 

We prove that (iv) implies (v). Fix a smooth curve €: R > R™” and 
an element (po,e0) € F(M). Assume, by contradiction, that there exists 
a real number T > 0 such that there exists a solution 6 : [0,T) —~ F(M) 
of equation (4.6.1) that cannot be extended to the interval [0,7 +) for 
any € > 0. Write 6(t) =: (y(t), e(t)) so that y and e satisfy the equations 

V(t) = e(t)&(t),  e(t) = hy (F(t)e(t), (0) = po, e(0) = eo. 
This implies e(t)n € Tyq)M and é(t)n € TriyM for all 7 € R™ and therefore 

d d . : 

FZ lmelt) e(t¢) = a (elt), e(6)0) = (e()n, e(4)6) + (elt), e(t)¢) = 0 


for all n,¢ € R™ and all t € [0,7). Thus the function t +> e(t)e(t) is con- 
stant, hence 


e(é)Te(t) = efeo, —fe(#)||] = sup PAL = yay (4.6.2) 
0 R™ In| 


for 0 <t< T, hence 
l4(t)] = le@)E()| < leoll 1€(4)| < lleol] sup_|é(s)| =: er 
0<s<T 


and so d(7(s), y(t)) < Lis.) < (@- s)er forO<s<t<T. Since (M,d) 
is a complete metric space, this shows that the limit p; := lim, zr y(t) © M 
exists. Thus the set K := 7({0,7')) U{pi} C M is compact and so is the set 


K:= { (p,e) €F(M)|pe K, eTe = egeo} C F(M). 


By equation (4.6.2) the curve [0,7) > R x F(M): t+ (t,7(t), e(t)) takes 
values in the compact set [0,7] x K and is the integral curve of a vector field 
on the manifold R x F(M). Hence Corollary 2.4.15 asserts that [0,7) cannot 
be the maximal existence interval of this integral curve, a contradiction. This 
shows that (iv) implies (v). 
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That (v) implies (vi) follows by taking €(t) = € in (v). 

We prove that (vi) implies (i). Fix an element po € M and a tan- 
gent vector vo € Tp,M. Let e9 € Liso(R™, Tp)M) be any isomorphism and 
choose € € R™ such that e9€ = vg. By (vi) the vector field Be has a unique 
integral curve R > F(M):t G(t) = (y(t), e(t)) with 


B(0) = (po, €0). 
Thus 


and hence 


Y(t) = E(t)E = hye E)e(t)E = hy wy F@), YQ). 

By the Gauf—Weingarten formula, this implies V7¥(t) = 0 for every t and 

hence 7 : R > M is a geodesic with 7(0) = po and ¥(0) = eo€ = vp. Thus M 
is geodesically complete and this shows that (vi) implies (i). 

The equivalence of (v) and (vii) was established in Corollary 3.5.25 and 

this shows that Theorem 4.6.6 implies Theorem 4.6.5. 


Proof of the Hopf-Rinow Theorem 
The proof of Theorem 4.6.6 relies on the next two lemmas. 


Lemma 4.6.7. Let M Cc R” be a connected m-manifold and p © M. Sup- 
pose € > 0 is smaller than the injectivity radius of M at p and denote 


X1(p) :={vET)M||v|=1}, S-(p) := {p' € M|d(p,p’) =e}. 


Then the map %i(p) + S.(p) : v + exp,(ev) is a diffeomorphism and, for 
allqg € M, we have 


d(p,q) > € — d(Se(p),q) = d(p, 4) — €. 
Proof. By Theorem 4.4.4, we have 
d(p, exp,(v)) = |v| for all v € T,M with |v| <e 
and 
d(p,p') >€ for all p' € M \ {exp,(v)|v € TpM, |v| <e}. 


This shows that S-(p) = exp,(€X1(p)) and, since ¢ is smaller than the injec- 
tivity radius, the map 


X1i(p) + Se(p) : v +> exp,(ev) 


is a diffeomorphism. 
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To prove the second assertion, let g € M such that 
Pi=d(p.g) Se 
Fix a constant 6 > 0 and choose a smooth curve ¥ : [0,1] > M such that 
VWO)=~p, W=aq LO)<rt+s 
Choose tp > 0 such that y(to) is the last point of the curve on S-(p), i.e. 


y(to) € Se(p), Y(t) € Se(p) for to <t <1. 


Then 
d(y(to), 4) < L(V |t,1)) 
= L(y) — L(V )0,t0)) 
ai) —s 
<r+6é-e 


This shows that d(S-(p),q) <7 +6 -—e for every 6 > 0 and therefore 


d(S-(p), q) < r—€. 
Moreover, 
d(p',q) = d(p,q) — d(p,p') =r —e 


for all p’ € S-(p). Thus 
d(S.(p), q) =r—e 


and this proves Lemma 4.6.7. 


Lemma 4.6.8 (Curve Shortening Lemma). Let M Cc R” be an m-mani- 
fold, let p € M, and let € be a real number such that 


0<eé< inj(p;M). 
Then, for allv,w € TpM, we have 


|v] =|w| =e, d(exp,(v), exp,(w)) = 2e = v+w=0. 
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Figure 4.7: Two unit tangent vectors. 


Proof. We will prove that, for all v,w € T,M, we have 


d Ov), 0) 
tiny Pa ere) lv — w]. (4.6.3) 
60 6 


Assume this holds and suppose, by contradiction, that there exist two tan- 
gent vectors v,w € T,M such that 


fe] = |e] =, d(exp, (ev), exp, (ew)) = 2e, v+w #0. 


Then 
ju-—w| <2 
(see Figure 4.7). Thus by (4.6.3) there exists a constant 0 < 6 < € such that 


d(exp, (dv), exp,(dw)) < 26. 


Then 
d(exp,(€v), exp,(ew)) 
< d(exp,(ev), exp, (dv)) + d(exp, (dv), exp,(dw)) + d(exp, (dw), exp, (ew)) 
<e—0+26+e€—-—6=2¢ 


and this contradicts our assumption. 
It remains to prove (4.6.3). For this we observe that 


_ d(exp, (dv), exp, (dw)) 


6-0 ) 

ies d(exp, (dv), exp,(dw)) exp, (dv) — exp, (dw)| 
50 |exp, (dv) - exp, (dw)| 6 

Waa |exp, (dv) — exp, (dw)| 
6-0 ) 

tie, exp,(6v)—p  exp,(dw) —p 
6-0 6 0) 

=|v-—w| 


Here the second equality follows from Lemma, 4.2.7. 
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Proof of Theorem 4.6.6. By assumption M C R” is a connected submani- 
fold, and p € M is given such that the exponential map exp, : T,M — M is 
defined on the entire tangent space at p. Fix a point qg € M \ {p} so that 


0< Ff =p, 9) < co. 


Choose a constant ¢ > 0 smaller than the injectivity radius of M at p and 
smaller than r. Then, by Lemma 4.6.7, we have 


d(S-(p),q) =r —€. 
Hence there exists a tangent vector v € T,M such that 
d(exp,(ev),q) =r —€, lo] = 1. 
Define the curve y: [0,r] + M by 
7(t) := exp,(tv) forO<t<r. 


By Lemma 4.3.6, this is a geodesic and it satisfies y(0) =p. We must 
prove that 7(r) =q and L(y) =d(p,q). Instead we will prove the follow- 
ing stronger statement. 


Claim. For every t € [0,r] we have 
d(y(t),q) =r —t. 


In particular, y(r) = q and L(y) =r = d(p,q). 


Consider the subset 
I :={t € [0,r] |d(yv,q) =r —t} c [0,7]. 
This set is nonempty, because « € J, it is obviously closed, and 
tel = [0,#] CTI. (4.6.4) 


Namely, if t € J andO<s <t, then 


d(y(s),q) < d(y(s), y(t)) + d(v(t),q) <t-s+r—-t=r-s 
and 
d(y(s),q) 2 d(p, 4) — d(p, 7(s)) 27 — 8. 
Hence d(7(s),q) =r —s and hence s € I. This proves (4.6.4). 
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We prove that J is open (in the relative topology of [0,7r]). Let t € I 
be given with t < r. Choose a constant ¢ > 0 smaller than the injectivity 
radius of M at y(t) and smaller than r —t. Then, by Lemma 4.6.7 with p 
replaced by 7(t), we have a 


d(S-(y(t)),q) =r —t—e. 
Next we choose w € T);)M such that 
|w| = 1, d(exp,) (ew), q)=r—t—e. 
Then 


IV 


d(y(t — €),q) — dlexpyx (ew), 9) 
(r —t+e)—(r—-t—e) 
2e. 


d(y(t — €), expy(ty(ew)) 


The converse inequality is obvious, because both points have distance € 
to y(t) (see Figure 4.8). 


Se(¥() EXP e Ww) 


Figure 4.8: The proof of the Hopf—Rinow theorem. 


Thus we have proved that 
d(y(t = =), €XP+(t) (ew)) = 2e. 
Since 
y(t — €) = expyzy(—e¥(4)), 
it follows from Lemma 4.6.8 that 
A = yh), 
Hence exp,)(sw) = y(t + s) and this implies that 
d(y(t+¢),q)=r—t-—e. 
Thus t+ € J and, by (4.6.4), we have [0,t +] € J. Thus we have proved 


that I is open. In other words, I is a nonempty subset of [0,7] which is 
both open and closed, and hence J = [0,r]. This proves the claim and 


Theorem 4.6.6. 
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4.7 Geodesics in the Intrinsic Setting* 


4.7.1 Intrinsic Distance 


Let M be a connected smooth manifold (§2.8) equipped with a Riemannian 
metric (§3.7). Then we can define the length of a curve 7 : [0,1] ~ M 
by the formula (4.1.1) and it is invariant under reparametrization as in 
Remark 4.1.3. The distance function d: Mx M —R is then given by 
the same formula (4.2.2). We prove that it still defines a metric on M and 


that this metric induces the same topology as the smooth structure. 


Lemma 4.7.1. Let M be a connected smooth Riemannian manifold and 
define the function d: M x M — (0,00) by (4.1.1), (4.2.1), and (4.2.2). 
Then d is a metric and induces the same topology as the smooth structure. 


Proof. The proof has three steps. 


Step 1. Fiz a point po € M and let 6: U > Q be a coordinate chart of M 
onto an open subset Q C R™ such that pp € U. Then there exists an open 
neighborhood V CU of po and constants 6,r > 0 such that 


5|6(p) — o(po)| < d(p, po) < 6" |o(p) — 6(po)| (4.7.1) 
for every p€ V and d(p,po) > or for everypE M\V. 


Denote the inverse of the coordinate chart ¢ by | := ¢@-!:2— M and 
define the map g = (gi), : QQ R™*<™ by gij(z) = (22 (x) 2 (x) u(x) 


ij=l dat \©)s Bai 
for « € Q. Then a smooth curve ¥: [0,1] > U has the length 
1 
Lo) =f feeTaemetar, el = o(r(0). (4.7.2) 


Let 29 := 6(po) € 2 and choose r > 0 such that B,(xo) C Q. Then there is 
a constant 6 € (0, 1] such that 


SIE] S VET G(aE SO" [| (4.7.3) 


for all x € B,(ao) and €,n € R™. Define V := ¢-1(B,(a9)) CU. 

Now let p € V and denote x := ¢(p) € B,(ao). Then, for every smooth 
curve 7: [0,1] > V with (0) = po and (1) = p, the curve c := ¢0 ¥ takes 
values in B,(xo) and satisfies c(0) = xp and c(1) = x. Hence, by (4.7.2) 
and (4.7.3), we have a 


Lo) 26 [to at >| f ateyat aii, 
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If y : [0,1] — M is a smooth curve with endpoints y(0) = po and y(1) = p 
whose image is not entirely contained in V, then there exists a T € (0, 1] 
such that y(t) € V for0 <t < T and y(T) € OV, so c(t) = o(7(t)) € B, (xo) 
for 0 <t < T and |c(T’) — xo| = r. Hence, by the above argument, we have 


L(y) = or. 


This shows that d(po,p) > 6r for p€ M\ V and d(po, p) > 6 |¢(p) — b(po)| 
forpeV. IfpeV, x := g(p), and c(t) := rp + t(x— 2), then y := wocisa 
smooth curve in V with y(0) = po and y(1) = p and, by (4.7.2) and (4.7.3), 


Af! 
L(y) <6 | la(t)| dt = 6-1 Ja — ao). 
0 


This proves Step 1. 
Step 2. d is a distance function. 


Step 1 shows that d(p, po) > 0 for every p € M \ {po} and hence d satisfies 
condition (i) in Lemma 4.2.3. The proofs of (ii) and (iii) remain unchanged 
in the intrinsic setting and this proves Step 2. 


Step 3. The topology on M induced by d agrees with the topology induced 
by the smooth structure. 


Assume first that WC M is open with respect to the manifold topology 
and let p) € W. Let 6: U > be a coordinate chart of M onto an open 
subset 2 Cc R™ such that po € U, and choose V C U and 6,r as in Step 1. 
Then ¢(V MW) is an open subset of 2 containing the point @(po). Hence 
there exists a constant 0 < « < ér such that Bs-1-(¢(po)) C 6(V NW). 
Thus by Step 1 we have d(p, po) > dr > € for allp € M\V. Hence, if p € M 
satisfies d(p,po) < ¢€, then p€V, so |6(p) — (po)| < 6-'d(p, po) < d-1e 
by (4.7.1), and therefore ¢(p) € ¢(VNW). Thus B.(po;d) C W and this 
shows that W is open with respect to d. 

Conversely, assume that W C M be open with respect to d and choose 
a coordinate chart 6: U > Q onto an open set 2 Cc R™. We must prove 
that ¢(U NW) is an open subset of 2. To see this, choose ro € d(UNW) 
and let po := ¢@ (a0) € UN W. Now choose V C U and 6,r as in Step 1. 
Choose ¢ > 0 such that Bs-1-(po;d) C W and B.(x9) C o(V). Let 2 € R” 
such that |x — xo| < e. Then x € ¢(V) and therefore p := ¢-1(x) € V. This 
implies d(p, po) < 6—+|¢(p) — ¢(po)| = 6+ |x — 20| < 5-+e, thus pe WNU, 
and so x = $(p) € 6(W NU). Thus ¢(W NU) is an open, and so W is open 
in the manifold topology of M. This proves Step 3 and Lemma 4.7.1. 
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4.7.2 Geodesics and the Levi-Civita Connection 


With the covariant derivative understood (Theorem 3.7.8), we can define 
geodesics on M as smooth curves 7: I > M that satisfy the equation 


Vr=4, 


as in Definition 4.1.5. Then all the above results about geodesics, as well 
as their proofs, carry over almost verbatim to the intrinsic setting. In 
particular, geodesics are in local coordinates described by equation (4.3.6) 


(Lemma 4.3.14) and they are the critical points of the energy functional 


1 
BO) = 5 f oe ae 


on the space Qp¢ of all paths y : [0,1] + M with fixed endpoints y(0) = p 
and 7(1) =q. Here we use the fact that Lemma 4.1.7 extends to the in- 
trinsic setting via the Embedding Theorem 2.9.12. So for every vector 
field X € Vect(y) along y with X(0) = 0 and X(1) = 0 there exists a curve 
of curves R> Qp¢:5'4 Ys with yo = y and Os7s5|s-0 = X. Then, by the 


properties of the Levi-Civita connection, we have 


1 1 
dE(y)X So Os |r7s(t)|? dt 
0 


II II 
| om, 

oy a6 
ee 2. 
a e 
= <A 
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Es 

Q 

oe 


The right hand side vanishes for all X if and only if Vy = 0 (Theorem 4.1.4). 
With this understood, we find that, for allp ¢ M andv € T,M, there exists 
a unique geodesic y: Ip, — M on a maximal open interval J,,, C R con- 
taining zero that satisfies 7(0) = p and 4(0) = v (Lemma 4.3.4). This gives 
rise to a smooth exponential map exp, : Vp = {v € TpM |1€ Ipv} + M as 
in §4.3 which satisfies dexp,(0) = id : T,M — T,M as in Corollary 4.3.7. 
This leads directly to the injectivity radius, the Gau8 Lemma 4.4.5, the local 
length minimizing property of geodesics in Theorem 4.4.4, and the Convex 
Neighborhood Theorem 4.5.3. Also the proof of the equivalence of metric 
and geodesic completeness in Theorem 4.6.5 and of the Hopf—Rinow Theo- 
rem 4.6.6 carry over verbatim to the intrinsic setting of general Riemannian 
manifolds. The only place where some care must be taken is in the proof of 
the Curve Shortening Lemma 4.6.8 as is spelled out in Exercise 4.7.2 below. 
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4.7.3. Examples and Exercises 


Exercise 4.7.2. Choose a coordinate chart ¢: U > 2 with ¢(po) = 0 such 
that the metric in local coordinates satisfies 
9ij(0) = 945. 
Refine the estimate (4.7.1) in the proof of Lemma 4.7.1 and show that 
p.arpo |(p) — O(9)| 
This is the intrinsic analogue of Lemma 4.2.8. Use this to prove that equa- 


tion (4.6.3) continues to hold for all Riemannian manifolds, i.e. 


d(exp, (dv), exp, (dw)) 
lim =| | 
0 6 


for p€ M and v,w € T,M. With this understood, the proof of the Curve 
Shortening Lemma 4.6.8 carries over verbatim to the intrinsic setting. 


Exercise 4.7.3. The real projective space RP” inherits a Riemannian met- 
ric from S” as it is a quotient of S” by an isometric involution. Prove that 
each geodesic in S” with its standard metric descends to a geodesic in RP”. 


Exercise 4.7.4. Let f : S? + S? be the Hopf fibration defined by 
f(z,w) = (is — jw? , 2Re Zw, 21m zw) 


Prove that the image of a great circle in S? is a nonconstant geodesic in S? 
if and only if it is orthogonal to the fibers of f, which are also great circles. 
Here we identify S° with the unit sphere in C?. (See also Exercise 2.5.21.) 


Exercise 4.7.5. Prove that a nonconstant geodesic y : R > S?"+! de- 
scends to a nonconstant geodesic in CP” with the Fubini-Study metric (see 
Example 3.7.5) if and only if (t) L Cy(t) for every t € R. 


Exercise 4.7.6. Consider the manifold 

F(R") = {D eR™*| DTD = 1} 
of orthonormal k-frames in R”, equipped with the Riemannian metric in- 
herited from the standard inner product 


(X,Y) := trace(X TY) 


on the space of real n x k-matrices. 
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(a) Prove that 
Tp F(R") = {x eR" |p'xX4x"D= o} 
TpF,(R")+ = {DA |A=ATe Re} 
and that the orthogonal projection H(D) : R"** + TpF;(R”) is given by 
I(D)X =X — 5D(DTX + XD). 
(b) Prove that the second fundamental form of F;,(IR”) is given by 
hp(X)Y = -5D(xTY +Y'X) 


for D € F(R") and X,Y € TpF;(R”). 
(c) Prove that a smooth map R > F;(R") : t 4 D(#) is a geodesic if and 
only if it satisfies the differential equation 


D=-DD'D. (4.7.4) 


Prove that the function D™ D is constant for every geodesic in F(R”). Com- 
pare this with Example 4.3.12. 


Exercise 4.7.7. Let G;(R”) = F;,(R")/O(k) be the real Grassmannian of 
k-dimensional subspaces in R”, equipped with a Riemannian metric as in 
Example 3.7.6. Prove that a geodesics R > F;(R") : t + D(t) descends 
to a nonconstant geodesic in G,(R”) if and only if D'D = 0 and D £0. 
Deduce that the exponential map on G;(R”) is given by 


exp, (A) =im (p cos (("8) *) +2 (DD) ie sin (278) )) 


for A € F(R") and A € T)F;,(R”) \ {0}. Here we identify the tangent 
space T,7;(R") with the space of linear maps from A to A+, and choose 
the matrices D € F(R”) and D € R™** such that 


A=imD, D'D=0, AoD=D:R*+At=kerD". 


Prove that the group O(n) acts on G;(IR") by isometries. Which subgroup 
acts trivially? 


Exercise 4.7.8. Carry over Exercises 4.7.6 and 4.7.7 to the complex Grass- 
mannian G;(C”). Prove that the group U(n) acts on G;(C”) by isometries. 
Which subgroup acts trivially? 
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Chapter 5 


Curvature 


This chapter begins by introducing the notion of an isometry (85.1). It 
shows that isometries of embedded manifolds preserve the lengths of curves 
and can be characterized as diffeomorphisms whose derivatives preserve the 
inner products. The chapter then moves on to the Riemann curvature ten- 
sor (§5.2). The next section is devoted to the generalized Gau8 Theorema 
Egregium which asserts that isometries preserve geodesics, the covariant 
derivative, and the Riemann curvature tensor (§5.3). The final section ex- 
amines the Riemann curvature tensor in local coordinates and shows how 
the definitions and results of the present chapter carry over to the intrinsic 


setting of Riemannian manifolds (§5.4). 


5.1 Isometries 


Characterization of Isometries 


Let M and M’ be connected submanifolds of R”. An isometry is an isomor- 
phism of the intrinsic geometries of M and M’. Recall the definition of the 
intrinsic distance function 


d: Mx M = (0,00) 
in 84.2 by 


1 
d(p,q):= inf L(y), Ly) = / I4(t)| at 


YEN, g 


for p,q € M. Let d’ denote the intrinisic distance function on M’. 
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Theorem 5.1.1 (Isometries). Let 6: M — M’ be a bijective map. Then 
the following are equivalent. 


(i) ¢ intertwines the distance functions on M and M", i.e. 


d'((p), 6(@)) = d(p, @) 


for allp,qeE M. 
(ii) ¢ is a diffeomorphism and 


dd(p) i TpM => Tp) M' 


is an orthogonal isomorphism for every p € M. 


(iii) @ is a diffeomorphism and 


L(go y) = L(y) 
for every smooth curve 7: [a,b] > M. 


The bijection @ is called an isometry iff it satisfies these equivalent condi- 
tions. In the case M = M' the isometries 6: M — M form a group denoted 
by Z(M) and called the isometry group of M. 


Proof. See page 226. 


Lemma 5.1.2. For every p € M there exists a constant ¢ > 0 such that, 
for allv,w€T,M with 0 < |w| < |v| < e, we have 


d(exp,(w), exp,(v)) = |v] — |w| == w= —v. (sale) 


Proof. See page 225. 


Remark 5.1.3. It follows from the triangle inequality and Theorem 4.4.4 
that ae 


d(expp(v), exp,(w)) = d(exp,(v), p) — dlexp,(w), p) = |v] — [w| 


whenever 0 < |w| < |v| < inj(p). Lemma 5.1.2 asserts that equality can 
only hold when w is a positive multiple of v or, to put it differently, that the 
distance between exp,(v) and exp,(w) must be strictly bigger that |v| — |w| 
whenever w is not a positive multiple of v. 
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Proof of Lemma 5.1.2. As in Corollary 4.3.8 we denote 


B-(p) := {vu € T>M | |v| < €}, 
Ue(p) = {¢ € M|d(p,q) < e}. 


By Theorem 4.4.4 and the definition of the injectivity radius, the exponential 
map at p is a diffeomorphism exp, : B.(p) + U-(p) for ¢ < inj(p). Choose 
0 <r <inj(p). Then the closure of U,(p) is a compact subset of M. Hence 
there is a constant ¢ > 0 such that ¢ < r and e€ < inj(p’) for every p’ € U;(p). 
Since € < r we have 


é<inj(p’) Vp’ €U-(p). (5.1.2) 


Thus exp, : B.(p') — Uz(p’) is a diffeomorphism for every p’ € U-.(p). 
Define p1 := exp,(w) and p2 := exp,(v). Then, by assumption, we have 
d(pi1, p2) = |v|—|w| < e. Since p; € U-(p) it follows from our choice of € that 
€ < inj(p;). Hence there is a unique tangent vector vj € Tp, M such that 


[v1] = d(p1, p2) = |v| — |v, &XD,, (%1) = p2.- 


Following first the shortest geodesic from p to p, and then the shortest 
geodesic from p; to pp we obtain (after suitable reparametrization) a smooth 
y: [0,2] + M such that 


VO=p, yA)=pi, (2) =pa, 
and 
L(Y\to,1)) = d(p,p1) = |v, L(y 1,2)) = d(pi, p2) = |v| — |v]. 


Thus L(y) = |v| = d(p,p2). Hence, by Theorem 4.4.4, there is a smooth 
function £ : [0,2] — [0,1] satisfying 


B(0)=0, 6(2)=1, BE) >0, Y(t) = exp, (8(E)v) 
for every t € [0,2]. This implies 
exp, (tw) = pr = (1) =exp,(B(1)v), 0 (4) <1. 


Since w and 6(1)v are both elements of B.(p) and exp, is injective on B.(p), 
this implies w = B(1)v. Since 6(1) > 0 we have G(1) = |w|/|v|. This 
proves (5.1.1) and Lemma 5.1.2. 
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Proof of Theorem 5.1.1. That (ii) implies (iii) follows from the definition of 
the length of a curve. Namely 


b 
u6er) = f [Sette] a 


b 
= : dé(y(t))4(t)| at 


b 
= f ola 
a 
= L(y). 
In the third equation we have used (ii). That (iii) implies (i) follows imme- 
diately from the definition of the intrinsic distance functions d and d’. 
We prove that (i) implies (ii). Fix a point p € M and choose ¢ > 0 so 
small that ¢ < inj(p) and that the assertion of Lemma 5.1.2 holds for the 
point p’ := ¢(p) € M’. Then there is a unique homeomorphism 


®, : B.(p) = B.((p)) 


such that the following diagram commutes. 


TM >  Be(p)—SBe(b(p))  C TypM’. 
ny Jorn 
M 5 Up) + Ue(4(p)) sc M’ 


Here the vertical maps are diffeomorphisms and ¢ : U:(p) + U-(@(p)) is a 
homeomorphism by (i). Hence ©, : B-(p) + B-(¢(p)) is a homeomorphism. 


Claim 1. The map ®, satisfies the following equations for every v € B.(p) 
and every t € [0,1]: 


exPlyp)(Bp(0)) = H(exP»(0)), (5.1.3) 
,(0)| = el, (5.1.4) 
&, (tv) = t®,(v). (5.1.5) 


Equation (5.1.3) holds by definition. To prove (5.1.4) we observe that, by 
Theorem 4.4.4, we have 
|®,(v)| = d'($(p), €xP 4p) (Pp(v))) 
= d'(9(p), d(exp,(v))) 
d(p, exp,(v)) 
= |v. 
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Here the second equation follows from (5.1.3) and the third equation from (i). 
Equation (5.1.5) holds for t = 0 because ®,(0) = 0 and for t = 1 it isa 
tautology. Hence assume 0 < t < 1. Then 


 (exPyy(Gpltv)), exP'yq(Bp(v))) = d'(HLexpp(tv)), dexpp(v))) 
d(exp,(tv), exp, (v)) 

|u| — |te| 

|®,(v)| — |®p(tv)| . 


Here the first equation follows from (5.1.3), the second equation from (i), 
the third equation from Theorem 4.4.4 and the fact that |v| < inj(p), and 


the last equation follows from (5.1.4). Since 0 < |®,(tv)| < |@,(v)| < € we 
can apply Lemma 5.1.2 and obtain 


This proves Claim 1. 
By Claim 1, ®p extends to a bijective map ©, : T,M — Typ) M’ via 


(0) = 5)(50), 


where 5 > O is chosen so small that d|v| < ¢. The right hand side of 
this equation is independent of the choice of 6. Hence the extension is well 
defined. It is bijective because the original map ®, is a bijection from B-(p) 
to B-(o(p)). The reader may verify that the extended map satisfies the 
conditions (5.1.4) and (5.1.5) for all v € T,M and all t > 0. 


Claim 2. The extended map ®, :T,M — T(p)M' is linear and preserves 
the inner product. 


It follows from the equation (4.6.3) in the proof of Lemma 4.6.8 that 


oo = d(exp,(tv), exp, (tw)) 
t0 t 


d'(¢(exp,(tv)), d(exPp(tw))) 
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Here the second equation follows from (i), the third from (5.1.3), the fourth 
from (5.1.4), and the last equation follows again from (4.6. 3.3). | By polariza- 
tion we obtain 


2(v,w) = |v? + |wl? — |v — w/? 
= |®p(v)? + |®p(w) |? — [®p(v) — &p(w)/? 
2(Pp(v), Pp(w)). 


Thus ©, preserves the inner product. Hence, for all vj,v2,w € T,M, we 
have 


(®p(v1 + v2), ®p(w)) (v1 + v2, w) 
= (v1,w) + (v2, w) 
= (8,(v1), ®p(w)) + (®p(v2), &p(w)) 
= (®,(v1) + ®p(v2), ®p(w)). 


Since ®, is surjective, this implies 
®p(v1 + v2) = Pp(v1) + Op(ve) 


for all vj, v2 € T,M. With v; = v and vg = —v we obtain 


for every v € T,M and by (5.1.5) this gives 
&p(tv) = tp(v) 
for all v € T,M and t € R. This proves Claim 2. 
Claim 3. ¢ is smooth and d¢(p) = ®p. 
By (5.1.3) we have 
= ExPip) Pp O exp, : Ue(p) + Ue(G(p)). 
Since ®, is linear, this shows that the restriction of ¢ to the open set U-(p) 


is smooth. Moreover, for every v € T,M we have 


d d ; 
do(p)v = =| d(expp(tv)) = F) exp) (t8p(v)) = Bp(v). 
t=0 t=0 


Here we have used equations (5.1.3) and (5.1.5) as well as Lemma 4.3.6. 
This proves Claim 3 and Theorem 5.1.1. 
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Exercise 5.1.4. Prove that every isometry ~ : R” — R” is an affine map 


W(p) = Ap +b 


where A € O(n) and b € R”. Thus ~ is a composition of translation and 
rotation. Hint: Let €1,...,¢€, be the standard basis of R™. Prove that any 
two vectors v,w € R” that satisfy 


lol = [ew 


and 
|u — e;| = |w — e;| ford = lang 


must be equal. 


Remark 5.1.5. If ~ : R” > R” is an isometry of the ambient Euclidean 
space with w(M) = M’, then certainly ¢ := w|,y is an isometry from M 
onto M’. On the other hand, if M is a plane manifold 


M ={(0,y,z) € R?|0<y < 2/2} 
and M’ is the cylindrical manifold 
M' = {(2,y,z) € R?|2? +y? =1, x > 0, y > 0}, 
Then the map ¢: M — M’ defined by 
(0, y, 2) = (cos(y), sin(y), 2) 


is an isometry which is not of the form ¢ = w|,7. Indeed, an isometry of the 
form @ = w|,y necessarily preserves the second fundamental form (as well 
as the first) in the sense that 


dw(p)hp(v, w) = hyp (dv (p)v, db(p)w) 
for v,w € T,M but in the example h vanishes identically while h’ does not. 
We may thus distinguish two fundamental question: 


I. Given M and M’ when are they extrinsically isomorphic, i.e. when is 
there an ambient isometry w : R" > R” with w(M) = M’? 


II. Given M and M’ when are they intrinsically isomorphic, i.e. when is 
there an isometry 6: M — M’ from M onto M’? 
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As we have noted, both the first and second fundamental forms are 
preserved by extrinsic isomorphisms while only the first fundamental form 
need be preserved by an intrinsic isomorphism (i.e. an isometry). 

A question which occurred to Gau8 (who worked for a while as a cartog- 
rapher) is this: Can one draw a perfectly accurate map of a portion of the 
earth? (i.e. a map for which the distance between points on the map is pro- 
portional to the distance between the corresponding points on the surface 
of the earth). We can now pose this question as follows: Is there an isom- 
etry from an open subset of a sphere to an open subset of a plane? Gauf 
answered this question negatively by associating an invariant, the Gaufian 
curvature K : M + R, to a surface M C R®. According to his Theorema 
Egregium 

K'o¢=K 
for an isometry ¢: M — M’. The sphere has positive curvature; the plane 
has zero curvature; hence the perfectly accurate map does not exist. Our 
aim is to explain these ideas. 


Local Isometries 
We shall need a concept slightly more general than that of “isometry”. 


Definition 5.1.6 (Local isometry). A smooth map ¢: M — M' is called 
a local isometry iff its derivative 


dd(p) : TpM => Tp) M' 
is an orthogonal linear isomorphism for every p € M. 
Remark 5.1.7. Let MC R" and M’ c R™ be manifolds and ¢: M — M’ 
be a map. The following are equivalent. 
(i) ¢ is a local isometry. 
(ii) For every p € M there are open neighborhoods U Cc M and U' c M' 
such that the restriction of ¢ to U is an isometry from U onto U’. 


That (ii) implies (i) follows immediately from Theorem 5.1.1. On the other 
hand (i) implies that dé(p) is invertible so that (ii) follows from the inverse 
function theorem. 


Example 5.1.8. The map 
Ro S': 0H e% 


is a local isometry but not an isometry. 
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Exercise 5.1.9. Let M Cc R” be a compact connected 1-manifold. Prove 
that M is diffeomorphic to the circle S!. Define the length of a compact 
connected Riemannian 1-manifold. Prove that two compact connected 1- 
manifolds M,M' Cc R" are isometric if and only if they have the same 
length. Hint: Let y : R > M be a geodesic with |+(t)| = 1. Show that 
y is not injective; otherwise construct an open cover of M without finite 
subcover. If tg < t1 with y(to) = y(t1), show that 7(to) = 7(t1); otherwise 
show that y(to +t) = 7(ti —t) for all t and find a contradiction. 


The next result asserts that two local isometries that have the same value 
and the same derivative at a single point must agree everywhere, provided 
that the domain is connected. 


Lemma 5.1.10. Let M Cc R” and M’ C R™ be smooth m-manifolds and 
assume that M is connected. Let 6: M > M' andw:M > M’ be local 
isometries and let po € M such that 


(po) = (po) =: po, db(po) = deb (po) : Tyo M > Ty M". 
Then $(p) = w(p) for every pe M. 
Proof. Define the set 
Mo := {p € M| 6(p) = ¥(p), ab(p) = dv(p)}. 


This set is obviously closed. We prove that Mo is open. Let p € Mo and 
choose U Cc M and U’ Cc M’ as in Remark 5.1.7 (ii). Denote 


®, := do(p) = dp(p):T,M > TyM', — p' := o(p) = ¥(p) 


Then it follows from equation (5.1.3) in the proof of Theorem 5.1.1 that 
there exists a constant ¢ > 0 such that U-(p) C U and U.(p’) Cc U’ and 


q € U-(p) == $(q) = exply 0B, o exp, '(q) = ¥(q). 


Hence U-(p) C Mo. Thus Mp is open, closed, and nonempty. Since M is 
connected it follows that Mp = M and this proves Lemma 5.1.10. 


Exercise 5.1.11. (i) If a sequence of local isometries ¢;: M — M’ con- 
verges uniformly to a local isometry ¢: M — M"', then it converges in the 
C@™ topology. Hint: Let p € M. Then every sufficiently small tangent vec- 
tor v € T,M satisfies the equation d¢(p)v = (expiyp)) | ((exPp(v)))- Use 
this to prove that dd;(p) converges to d¢é(p). Deduce that ¢; converges to ¢ 
uniformly with all derviatives in a neighborhood of p. 

(ii) The C™ topology on the space of local isometries from M to M' agrees 
with the C° topology. 
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5.2 The Riemann Curvature Tensor 


5.2.1 Definition and Gau8-—Codazzi 


Let M C R” be a smooth manifold and y : R? + M be a smooth map. 
Denote by (s,¢) the coordinates on R?. Let Z € Vect(7) be a smooth vector 
field along 7, i.e. Z : R* > R” is a smooth map such that Z(s,t) € Ty54)M 
for all s and t. The covariant partial derivatives of Z with respect to 
the variables s and t are defined by 

OZ OZ 

DS: MZ = I(y) 5 


In particular 0.7 = O7/0s and Oyy = Oy/Ot are vector fields along 7 and we 
have 


V;Z := I(7) 


VsOry — VeOs'y = 0 


as both terms on the left are equal to II(y)0;0:y7. Thus ordinary partial 
differentiation and covariant partial differentiation commute. The analogous 
formula (which results on replacing 0 by V and ¥ by Z) is in general false. 
Instead we have the following. 


Definition 5.2.1. The Riemann curvature tensor assigns to each p © M 
the bilinear map 


Rp: T,M x T,M > L(T,M,T,M) 
characterized by the equation 
R,(u,v)w = (VsVZ — ViV-Z) (0, 0) (5.2.1) 


for u,v,w € T,M where y:R? + M is a smooth map and Z € Vect(y) is a 
smooth vector field along y such that 


(0,0) =p; O;y(0, 0) = u, Ovy(0, 0) = v, ZO, 090. (5:22) 


We must prove that R is well defined, i.e. that the right hand side of 
equation (5.2.1) is independent of the choice of y and Z. This follows from 
the Gauf—Codazzi formula which we prove next. Recall that the second fun- 
damental form can be viewed as a linear map hy : TpM + L(T,M,T,M+) 
and that, for u € T,M, the linear map h,(u) € L(T>M,T,M+) and its 
dual h,(u)* € £L(T,M+,T,M) are given by 


hyp(u)v = (dII(p)u)v, hyp(u)*w = (dII(p)u) w 
for v € T,)M and w € T,M+. 
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Theorem 5.2.2. The Riemann curvature tensor is well defined and given 
by the Gau8B—Codazzi formula 


Ry (u,v) = hp(u)*hp(v) — hp(v)*hp(w) (5.2.3) 
foru,v €T,M. 


Proof. Let u,v,w € T,M and choose a smooth map y : R? > M anda 
smooth vector field Z along y such that (5.2.2) holds. Then, by the Gauf— 
Weingarten formula (3.2.2), we have 


ViZ = OZ = ha (Ory) Z 


= OZ — (dIl(y)&7)Z 
AZ — (A(Ioy)) 2. 


Hence 


ANZ = 54% —0,((A(I7)) Z) 


7) 
= OsO,Z = (O50; (II ° 7)) Z— (0; (II fe} 7)) O,Z 
= 0542 — (Ag%(IL07)) Z — (dU()d:7) (VeZ + hy(Os7)Z) 
= 0,0,Z — (0,0;(IL 0 7)) Z — hy(O7)VeZ — hy (Opy)*hy (Oey) Z. 


Interchanging s and t and taking the difference we obtain 


OZ —OVoZ = hy(Osy)*hy(Opy)Z — hy(7)*hy(Os7)Z 
se hy(Osy)WZ _ hn (Opvy) Vs Z- 


Here the first two terms on the right are tangent to M and the last two 
terms on the right are orthogonal to T,M. Hence 


VsViZ-VVsZ = I(y)(OsViZ — 4V.Z) 
= hy(Osy)"hy(O7y)Z — hy (Ory) hy (Osy) Z. 


Evaluating the right hand side at s = t = 0 we find that 
(VsViZ — ViVsZ) (0,0) = hp(u)*Ap(v)w — Ap(v)*hp(u)w. 


This proves the Gau$8—Codazzi equation and shows that the left hand side 
is independent of the choice of y and Z. This proves Theorem 5.2.2. 


234 CHAPTER 5. CURVATURE 


5.2.2 Covariant Derivative of a Global Vector Field 


So far we have only defined the covariant derivatives of vector fields along 
curves. The same method can be applied to global vector fields. This leads 
to the following definition. 


Definition 5.2.3 (Covariant derivative). Let M Cc R” be an m-dimen- 
sional submanifold and X be a vector field on M. Fix a point p € M and 
a tangent vector v € T,M. The covariant derivative of X at p in the 
direction v is the tangent vector 


WX (p) := I(p)dX(p)v € T,M, 
where II(p) € R"*” denotes the orthogonal projection onto T,M. 


Remark 5.2.4. If y: J — M is a smooth curve on an interval J C R and 
X € Vect(M) is a smooth vector field on M, then X 07 is a smooth vector 
field along y. The covariant derivative of X oy is related to the covariant 
derivative of X by the formula 


V(X 07)(t) = Vy X(v(0))- (5.2.4) 


Remark 5.2.5 (Gau8—-Weingarten formula). Differentiating the equa- 
tion X = ILX (understood as a function from M to R”) and using the 
notation O,X(p) := dX(p)v for the derivative of X at p in the direction v 
we obtain the Gau8—Weingarten formula for global vector fields: 


O,X (p) = VoX (p) + hp(v)X(p). (5.2.5) 
Remark 5.2.6 (Levi-Civita connection). Differentiating a vector field 
Y on M in the direction of another vector field X we obtain a vector field 
VxY € Vect(M) defined by 

(VxY)(p) = Vx(py¥ (P) 
for p € M. This gives rise to a family of linear operators 
Vx : Vect(M) > Vect(M), 

one for every vector field X € Vect(M), and the assignment 


Vect(M) — L(Vect(M), Vect(M)) : X + Vx 
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is itself a linear operator. This operator is called the Levi-Civita connec- 
tion on the tangent bundle 7'M. It satisfies the conditions 


Vex(Y) = fVXY, (5.2.6) 

Vx (fY) = fVxY + (Lxf)Y, (5.2.7) 
LX{Y, Z) = (VxY, Z) H(Y, VxZ), (5.2.8) 
WX —VxY = [X,Y] (5.2.9) 


for all X,Y,Z € Vect(M) and f € F(M), where Lx f = dfoX and 
[X,Y] € Vect(M) denotes the Lie bracket of the vector fields X and Y. 
The next lemma asserts that the Levi-Civita connection is uniquely deter- 
mined by (5.2.8) and (5.2.9). 


Lemma 5.2.7 (Uniqueness Lemma). There is a unique linear operator 
Vect(M) + L(Vect(M), Vect(M)) : X 4 Vx 
satisfying equations (5.2.8) and (5.2.9) for all X,Y, Z © Vect(M). 


Proof. Existence follows from the properties of the Levi-Civita connection. 
We prove uniqueness. Let X ++ Dx be any linear operator from Vect(M) 
o L(Vect(M), Vect(M)) that satisfies (5.2.8) and (5.2.9). Then we have 


LxiYy4)= (DxY, Z)+ (Y, DxZ), 
Ly (X,Z) = (DyX, Z)+ (X, Dy Z), 
SANS 07 Yi er. 


Adding these three equations we find 
Lay, Z) a Ly (Z, X) _— L£z(X,Y) 
= 2(DxY, Z) + (Dy X — DxY, Z) 


+(X, DyZ — DzY) +(Y, DxZ — DzX) 
= 2(DxY, Z) + ([X,Y], Z) + (X,[Z, Y]) + (Y, [Z, X]). 


The same equation holds for the Levi-Civita connection and hence 


(DxY, Z) = (Vx, Z). 


This implies Dx Y = VxY for all X,Y € Vect(M). 
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Remark 5.2.8 (The Levi-Civita connection in local coordinates). 
Let 6: U > Q be a coordinate chart on an open set U C M with values in 
an open set 2. C R™. In such a coordinate chart a vector field X € Vect(/) 
is represented by a smooth map 


CS aye OS 
defined by 


£(0(p)) = do(p) X (p) 


for p € U. If Y € Vect(M) is represented by 7, then VxY is represented by 
the map 


POs a or 
(Ven)* = Ss aut + SS Te ein! (5.2.10) 
i=1 ij=l 


Here the rk :Q — R are the Christoffel symbols defined by 


pel (O90 , O90; Ogi; 
Te = ke 2 a ee J 2. 
uv 29 2 (5 Ort = Ax* }? eeey) 


where gj; is the metric tensor and g’ is the inverse matrix so that 


> 919" = 8 
j 


(see Lemma 3.6.5). This formula can be used to prove the existence state- 
ment in Lemma 5.2.7 and hence define the Levi-Civita connection in the 
intrinsic setting. 


Exercise 5.2.9. In the proof of Lemma 5.2.7 we did not actually use 
that the operator Dx : Vect(M) — Vect(M) is linear nor that the oper- 
ator X +» Dx is linear. Prove directly that if a map 


Dx : £(M) > £(M) 


satisfies (5.2.8) for all Y, Z € Vect(/), then Dx is linear. Prove that every 
map 


Vect(M) — L(Vect(M), Vect(M)) : X 4 Dx 


that satisfies (5.2.9) is linear. 
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5.2.3. A Global Formula 
Lemma 5.2.10. For X,Y,Z € Vect(M) we have 


R(X, Y)Z = VWweWZ- WZ 4+ VixyZ- (5.2.12) 


Proof. Fix a point p € M. Then the right hand side of equation (5.2.12) at 
p remains unchanged if we multiply each of the vector fields X,Y,Z by a 
smooth function f : M — [0,1] that is equal to one near p. Choosing f with 
compact support we may therefore assume that the vector fields X and Y 
are complete. Let ¢° denote the flow of X and u' the flow of Y. Define the 
map y: R? > M by 


7(s, t) = ¢ ov'(p), s,teER. 


Then 
Oy=X(y), = hy = (GY) (0). 


Hence, by Remark 5.2.5, we have 
Vs(Z 07) =(VxZ) (7),  V(Zo7) = (VesyZ) (1). 
This implies 
VsVi(Z 07) = (Vo,7Vesy Z) (1) + (Vo,esyZ) (7): 
Since 
cs OY = |x, ¥| 
OGG 
and Osy = X(7) we obtain 


VsVi(Z 0 )(0,0) = Vx WZ(p) + VixyjZ(p), 
VVs(Z 0 y)(0,0) = WVxZ(p). 


(VeVi(Z oy) — VWiVe(Z 0)) (0, 0) 
Vx W Z(p) — WVxZ(p) + Vix,y)Z(p)- 


= 
bai 
= 
a 
= 
aN 
S 
| 


This proves Lemma, 5.2.10. 
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Remark 5.2.11. Equation (5.2.12) can be written succinctly as 


[Vx, W] + Vixy] = R(X,Y). (5.2.13) 
This can be contrasted with the equation 
[Lx, Ly] + Lyx) =0 (5.2.14) 
for the operator £x on the space of real valued functions on M. 


Remark 5.2.12. Equation (5.2.12) can be used to define the Riemann 
curvature tensor. To do this one must again prove that the right hand side 
of equation (5.2.12) at p depends only on the values X(p), Y(p), Z(p) of the 


vector fields X,Y, Z at the point p. For this it suffices to prove that the map 
Vect(M) x Vect(M) x Vect(M) > Vect(M) : (X,Y, Z) WH R(X, Y)Z 
is linear over the Ring *(M) of smooth real valued functions on M, i.e. 
RX, Y)2Z = RA TY)Z = REY FZ = fRAY)Z (5.2.15) 


for X,Y,Z € Vect(M) and f € F(M). The formula (5.2.15) follows from 
the equations (5.2.6), (5.2.7), (5.2.14), and |X, fY] = f[X,Y] -— (Lx f)Y. It 


follows from (5.2.15) that the right hand side of (5.2.12) at p depends only 
on the vectors X(p), Y(p), Z(p). The proof requires two steps. One first 
shows that if X vanishes near p, then the right hand side of (5.2.12) vanishes 
at p (and similarly for Y and Z). Just multiply X by a smooth function 
equal to zero at p and equal to one on the support of X; then fX = X and 
hence the vector field R(X,Y)Z = R(fX,Y)Z = fR(X,Y)Z vanishes at p. 
Second, we choose a local frame F),..., Em € Vect(M), i.e. vector fields 
that form a basis of T,.M for each p in some open set U C M. Then we may 


write - 5 “ 
X=y\leR, Y= Pey 2=> CR 
i=l j=l k=1 


in U. Using the first step and the #(M)-multilinearity we obtain 


R(X,Y)Z= So ei CRR(E,, Ej) Ex 
ij,k=1 


in U. If X'(p) = X(p), then '(p) = €"(p) so if X(p) = X'(p), Y(p) = Y'(p), 
Z(p) = Z'(p), then (R(X, Y)Z)(p) = (R(X', Y’)Z’)(p) as required. 
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5.2.4 Symmetries 


Theorem 5.2.13. The Riemann curvature tensor satisfies 


R(Y,X) =—R(X,Y) = R(X,Y)*, (5.2.16) 
R(X,Y)Z + R(Y,Z)X + R(Z, X)Y =0, (5.2.17) 
(R(X, Y)Z,W) = (R(Z,W)X,Y) (5.2.18) 


for X,Y,Z,W © Vect(M). Equation (5.2.17) is the first Bianchi identity. 


Proof. The first equation in (5.2.16) is obvious from the definition and the 
second follows from the Gau8—Codazzi formula (5.2.3). Alternatively, choose 
a smooth map 7: R*? + M and two vector fields Z,W along y. Then 
0 = 0,0:(Z2,W) — 0:0.(Z, W) 
Os(ViZ, W) + 05(Z, WW) — O(VsZ,W) — O.(Z, VsW) 
(R(Osy, #Y)Z,W) — (Z, R(Osy, 47) W). 


This proof has the advantage that it carries over to the intrinsic setting. We 
prove the first Bianchi identity using (5.2.9) and (5.2.12): 


R(X, Y)Z+ R(Y, Z)X + R(Z,X)Y 

= V«WZ—-WVxZ + Vixy)Z + WVeX — V2 WX + Vy.2)X 
+VzVxY — VxVzY + Viz.xj¥ 

= Vy.g)X — VxlY, Z] + Vig ¥ — WIZ, X] + VixyyZ — VelX, ¥] 

= [X, [Y, Z]] + [Y, [Z, X] + [Z, [X,Y]. 


The last term vanishes by the Jacobi identity. We prove (5.2.18) by com- 
bining the first Bianchi identity with (5.2.16): 


(R(X, Y)Z,W) — (R(Z,W)X,Y) 

= —(R(Y, Z)X,W) — (R(Z, X)Y,W) — (R(Z,W)X,Y) 

= (R(Y, Z)W, X) + (R(Z, X)W,Y) + (R(W, Z)X,Y) 

= (R(Y, Z)W, X) — (R(X,W)Z,Y) 

= (R(Y, Z)W, X) — (RW, X)Y, 2). 
Note that the first line is related to the last by a cyclic permutation. Re- 
peating this argument we find 


(R(Y, Z)W, X) — (R(W, X)Y, Z) = (R(Z,W)X,Y) — (R(X, Y)Z,W). 


Combining the last two identities we obtain (5.2.18). This proves Theo- 
rem 5.2.13. 
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Remark 5.2.14. We may think of a vector field X on M as a section of 
the tangent bundle. This is reflected in the alternative notation 


0°(M,TM) := Vect(M). 


A 1-form on M with values in the tangent bundle is a collection of 
linear maps A(p) : T,M — T,M, one for every p € M, which is smooth 
in the sense that for every smooth vector field X on M the assignment 
p++ A(p)X(p) defines again a smooth vector field on M. We denote by 


01(M,TM) 


the space of smooth 1-forms on M with values in 7M. The covariant deriva- 
tive of a vector field Y is such a 1-form with values in the tangent bundle 
which assigns to every p € M the linear map T,M > T,M :v +> WY (p). 
Thus we can think of the covariant derivative as a linear operator 


V:0°(M,TM) > 01(M,TM). 


The equation (5.2.6) asserts that the operators X ++ Vx indeed determine 
a linear operator from 0°(M,TM) to Q'(M,TM). Equation (5.2.7) as- 
serts that this linear operator V is a connection on the tangent bundle 
of M. Equation (5.2.8) asserts that V is a Riemannian connection and 
equation (5.2.9) asserts that V is torsion-free. Thus Lemma 5.2.7 can 
be restated as asserting that the Levi-Civita connection is the unique 
torsion-free Riemannian connection on the tangent bundle. 


Exercise 5.2.15. Extend the notion of a connection to a general vector bun- 
dle E, both as a collection of linear operators Vx : 2°(M, E) > 0°(M, E), 
one for every vector field X € Vect(/), and as a linear operator 


V :0°(M, E) > 01(M, E) 


satisfying the analogue of equation (5.2.7). Interpret this equation as a Leib- 
niz rule for the product of a function on M with a section of E. Show that 
V~ is a connection on TM+. Extend the notion of curvature to connections 
on general vector bundles. 


Exercise 5.2.16. Show that the field which assigns to each p € M the 
multi-linear map ies :TpM x T,M — L(T,M+,T,M*+) characterized by 


R+(0,7,y)Y = VEViY — VEVLY 
for y: R? > M and Y € Vect+(y) satisfies the equation 
RE (u,v) = hp(t)lip(v)* — hp(v)Rp(u)* 
forp € M and u,v € 7,M. 
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5.2.5 Riemannian Metrics on Lie Groups 


We begin with a calculation of the Riemann curvature tensor on a Lie sub- 
group of the orthogonal group O(n) with the Riemannian metric inherited 
from the standard inner product 


(v, w) := trace(v'w) (5.2.19) 


on the ambient space gl(n,R) = R"*". This fits into the extrinsic setting 
used throughout most of this book. Note that every Lie subgroup of O(n) 
is a closed subset of O(n) by Theorem 2.5.26 and hence is compact. 


Example 5.2.17. Let G C O(n) be a Lie subgroup and let 
g := Lie(G) = 7T)G 


be the Lie algebra of G. Consider the Riemannian metric on G induced by 
the inner product (5.2.19) on R"~". Then the Riemann curvature tensor 


on G can be expressed in terms of the Lie bracket (see item (d) below). 
1 


(a) The maps g +> ag, g +> ga, g++ g_~ are isometries of G for every a € G. 
(b) Let y: R > G be a smooth curve and X € Vect(y) be a smooth vector 


field along 7. Then the covariant derivative of X is given by 


AVX = FOX +5 HOMO. AH XO]. 6.2.20) 


(Exercise: Prove equation (5.2.20). Hint: Since g C o(n) we have the 
identity trace((€n + n€)¢) = 0 for all €,7,¢ € g.) 

(c) A smooth map 7: R > G is a geodesic if and only if there exist matrices 
g € Gand € € g such that 


y(t) = gexp(té). (5.2.21) 


For G = O(n) we have seen this in Example 4.3.12 and in the general 
case this follows from equation (5.2.20) with X = +. Hence the exponential 
map exp : g > G defined by the exponential matrix (as in §2.5) agrees with 
the time-1-map of the geodesic flow (as in §4.3). —_ 


(d) The Riemann curvature tensor on G is given by 
= Woe es 2 = 
g'Ro(u,v)w = —Fllg'ug Vv), gw] (5.2.22) 


forg € Gand u,v,w € TG. Note that the first Bianchi identity is equivalent 
to the Jacobi identity. (Exercise: Prove equation (5.2.22).) 
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Definition 5.2.18 (Bi-invariant Riemannian metric). Let G be a Lie 
subgroup of GL(n,R) and let g = Lie(G) = T4G be its Lie algebra. A Rie- 
mannian metric on G is called bi-invariant iff it has the form 

(v, w)g = (vg™*, wg") (5.2.23) 


forg € Gandv,w € T,G, where (-,-) is an inner product on the Lie algebra g 
that is invariant under conjugation, i.e. it satisfies the equation 


(é,n) = (9&9 *, 9ng"'). (5.2.24) 
for all €,n € g and allg EG. 
Exercise 5.2.19. Prove that the Riemannian metric induced by (5.2.19) 
on any Lie subgroup G C O(n) is bi-invariant. 
Exercise 5.2.20. Use the Haar measure ([66, Chapter 8]) to prove that 
every compact Lie group admits a bi-invariant Riemannian metric. 
Exercise 5.2.21. Prove that all the assertions in Example 5.2.17 carry over 
verbatim to any Lie group equipped with a bi-invariant Riemann metric. 


Exercise 5.2.22 (Invariant inner product). Prove that, if an inner prod- 
uct on the Lie algebra g of a Lie group G is invariant under conjugation, 
then it satisfies the equation 


([é,7],¢) = (6, [n, ¢]) (5.2.25) 


for all €,n,¢ € g. If G is connected, prove that, conversely, equation (5.2.25) 
implies (5.2.24). An inner product on an arbitrary Lie algebra g is called 
invariant iff it satisfies equation (5.2.25). 


Exercise 5.2.23 (Commutant). Let g be a finite-dimensional Lie algebra. 
The linear subspace spanned by all vectors of the form [€,7] is called the 
commutant of g and is denoted by [g,g] := span {[€,7]|€,7 € g}. If g 
is equipped with an invariant inner product, prove that [g,g]+ = Z(g) is 
the center of g (Exercise 2.5.33) and hence g = [g,g] ® Z(g). Prove that the 
Heisenberg algebra h in Exercise 2.5.15 satisfies [h, h] = Z(h) and hence does 
not admit an invariant inner product. 


Example 5.2.24 (Killing form). Every finite-dimensional Lie algebra g 
admits a natural symmetric bilinear form « : g x g > R that satisfies equa- 
tion (5.2.25). It is called the Killing form and is defined by 


K(E, 7) := trace(ad(€)ad(m)), £,n €g, (5.2.26) 


where ad : g — Der(g) is the adjoint representation in Example 2.5.22. The 
Killing form may have a kernel (which always contains the center of g), and 
even if it is nondegenerate, it may be indefinite. 
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Exercise 5.2.25. Prove that «([€,7],¢) = «(€,[7,¢]) for all €,7,¢ € g. 


Exercise 5.2.26. Assume that g admits an invariant inner product. For 
each € € g prove that the derivation ad(&) is skew-adjoint with respect to this 
inner product and deduce that «(€,£) = —trace(ad(€)*ad(€)) = —|ad(€)|?. 
Deduce that the Killing form is nondegenerate whenever g has a trivial 
center and admits an invariant inner product. 


Example 5.2.27 (Right invariant Riemannian metric). Let G be 
any Lie subgroup of GL(n,R) (not necessarily contained in O(n)), and 
let g := Lie(G) = 7G be its Lie algebra. Fix any inner product on the 
Lie algebra g (not necessarily invariant under conjugation) and consider the 
Riemannian metric on G defined by 


(v, w)g = (vg7*, wg) (5.2.27) 


for v,w € T,G. This metric is called right invariant. 


(a) The map g +> ga is an isometry of G for every a € G. 
(b) Define the linear map A : g > End(g) by 


(A(En, 6) = 5 (Eb. ) — (ns) — Gnd) (5.2.28) 
for €,7,¢ € g. Then A is the unique linear map that satisfies 


A(Q) + AE)" = 0, — AlmE — A(E)n = [én 


for all £,7 € g, where A(€)* is the adjoint operator with respect to the inner 
product on g. Let y : R > G be a smooth curve and X € Vect(y) be a 
smooth vector field along y. Then the covariant derivative of X is given by 


Ves (Ga) + AG )xy7} 7. (5.2.29) 


(Exercise: Prove this. Moreover, if the inner produt on g is invariant, prove 
that A(E)n = —5[€,] for all €,7 € g.) 


(c) A smooth curve 7: R > G is a geodesic if and only if it satisfies 
(eres en ines see 

= (Vy) + AG")? = 0. (5.2.30) 
dt 

(Exercise: G is complete.) 


(d) The Riemann curvature tensor on G is given by 


(Ro(E9,n9)69)9-! = (A (nl) + A © AMI) S (5.2.31) 
for g € G and £,7,¢ € g. (Exercise: Prove this.) 
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5.3 Generalized Theorema Egregium 


We will now show that geodesics, covariant differentiation, parallel trans- 
port, and the Riemann curvature tensor are all intrinsic, i.e. they are in- 
tertwined by isometries. In the extrinsic setting these results are somewhat 
surprising since these objects are all defined using the second fundamental 
form, whereas isometries need not preserve the second fundamental form in 
any sense but only the first fundamental form. 

Below we shall give a formula expressing the Gaufian curvature of a 
surface M? in R® in terms of the Riemann curvature tensor and the first 
fundamental form. It follows that the Gaufian curvature is also intrinsic. 
This fact was called by Gauf the “Theorema Egregium” which explains the 
title of this section. 


5.3.1 Pushforward 


We assume throughout this section that M CR” and M’ Cc R™ are smooth 
submanifolds of the same dimension m. As in §5.1 we denote objects on M’ 
by the same letters as objects in M with primes affixed. In particular, g’ 
denotes the first fundamental form on M’ and R’ denotes the Riemann 
curvature tensor on M’. 

Let 6: M — M’' be a diffeomorphism. Using @ we can move objects 
on M to M’. For example the pushforward of a smooth curve y: I > M is 
the curve 

dey = Poy: TOM’, 


the pushforward of a smooth function f : M — R is the function 


buf = fod !:M' OR, 


the pushforward of a vector field X € Vect(y) along a curve y: I > M is 
the vector field ¢,X € Vect(¢.7) defined by 


(PX )(t) = db(y(t)) X(t) 


for t € I, and the pushforward of a global vector field X € Vect(M) is the 
vector field ¢,X € Vect(M’) defined by 


(dX )(O(p)) := do(p) X (p) 


for p € M. Recall that the first fundamental form on M is the Riemannian 
metric g defined as the restriction of the Euclidean inner product on the 
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ambient space to each tangent space of M. It assigns to each p € M the 
bilinear map gp € T,M x T,M — R given by 


Gp(u,v) = (u,v), u,v € TpM. 


Its pushforward is the Riemannian metric which assigns to each p’ € M’ the 
inner product ($49) »' : TM’ x T,yM’' > R defined by 


($49) o(p) (do(p)u, do(p)v) := gp (u,v) 


for p := @ +(p') € M and u,v € T,M. The pushforward of the Riemann 
curvature tensor is the tensor which assigns to each p’ € M’ the bilinear 
map (¢4R)p : TM! x Ty M! — £L (Ty M', Ty M") , defined by 


($2) 6(p) (db(p)u, do(p)v) := db(p)Rp (u,v) db(p)~* 


for p:= d'(p') € M and u,v € T,M. 


5.3.2 Theorema Egregium 


Theorem 5.3.1 (Theorema Egregium). The first fundamental form, co- 
variant differentiation, geodesics, parallel transport, and the Riemann cur- 
vature tensor are intrinsic. This means that for every isometry ¢: M > M’ 
the following holds. 


(i) dg = 9’. 
(ii) If X © Vect(y) ts a vector field along a smooth curve y: I + M, then 


V' (b4X) = ba VX, (5:31) 
and if X,Y € Vect(M) are global vector fields, then 


bax OxY = bs(VXY). (53.2) 


(iii) If y: I + M is a geodesic, then doy: I — M' is a geodesic. 
(iv) Ify:I— M is a smooth curve, then for all s,t € I, we have 


goy(t 8)4b(7(s)) = do(y(t)) ®y(t, 8). (5.3.3) 


(v) & R= R’. 


246 CHAPTER 5. CURVATURE 


Proof. Assertion (i) is simply a restatement of Theorem 5.1.1. To prove (ii) 
we choose a local smooth parametrization ~ : Q — U of an open set U c M, 
defined on an open set 2 C R™, so that ~~! : U > Q is a coordinate chart. 
Suppose without loss of generality that y(t) € U for all t € J and define 
c:Il>Qand €:I > R"” by 


nt) =e), X= Lee) 
i=1 


Recall from equations (3.6.6) and (3.6.7) that 
= S° E*( +¥ork rk. (t)€? (t) ak (elt), 
ie 
k=] ij=l 

where the Christoffel symbols ry :Q — R are defined by 

a “0 ne OY 

Ox'OxI S @qgk 
Now consider the same formula for ¢,X using the parametrization 

WW :=¢00:N23U' :=¢(U) CM’. 


The Christoffel symbols I”; . : Q > R associated to this parametrization of 
U’ are defined by the same ‘fonnuls as the ry with y replaced by w’. But 
the metric tensor for y agrees with the metric tensor for ’: 


: (3 oi) = (3 a 
Fi \ Oat? Ox Ox*’ OxI [- 


Hence it follows from Lemma 3.6.5 that I; k =Ti, for all 2,7, &. This implies 
that the covariant derivative of 6,X is snch by 


uae ee at .\ Ay’ 
(+ Ko ride | Be 


k=1 ij=l 


= aso) > (& + Vo rhode | F200 


k=1 ij=l 


V' (bX) 


= VX. 


This proves (5.3.1). Equation (5.3.2) follows immediately from (5.3.1) and 


Remark 5.2.4. 
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Here is a second proof of (ii). For every vector field X € Vect(M) we 
define the operator Dx : Vect(M) — Vect(M) by 


DxY := * (Va.xde¥). 
Then, for all X,Y € Vect(M/), we have 

Dy X — DxY = o* (We. vOeX — Ve.xG«Y) = b"|b4X, OxY] = [X,Y]. 
Moreover, it follows from (i) that 


PLx{Y,Z) = L4.x(b«Y, bxZ) 

(Vb.XP«Y, brZ) + (Ga Y, Vix GZ) 
= (bsDxY,sZ) + (O«Y, bsDx Z) 

= ¢4((DxY,Z) + (Y,DxZ)). 


and hence £Lx{Y,Z) = (DxY,Z) + (Y,DxZ) for all X,Y,Z © Vect(M). 
Thus the operator X ++ Dx satisfies equations (5.2.8) and (5.2.9) and, by 
Lemma 5.2.7, it follows that DxY = VxY for all X,Y © Vect(M). This 
completes the second proof of (ii). 

We prove (iii). Since ¢@ preserves the first fundamental form it also 
preserves the energy of curves, namely 


E(¢o 7) = Ey) 


for every smooth map ¥ : [0,1] + M. Hence 7 is a critical point of the energy 
functional if and only if 60 y is a critical point of the energy functional. 
Alternatively it follows from (ii) that 


,(d a . 
Vv (007) = V'bsy = Ou VY 


for every smooth curve y: I > M. If 7 is a geodesic, the last term vanishes 

and hence ¢07 is a geodesic as well. As a third proof we can deduce (iii) from 

the formula ¢(exp,(v)) = expgp)(d¢(p)v) in the proof of Theorem 5.1.1. 
We prove (iv). For to € I and vp € T,q4,)M define ——— 


X(t) = ®,(t, to)vo, X"(t) := Boq(t, to)do((to))v0- 


By (ii) the vector fields X’ and ¢,X along ¢0 7 are both parallel and they 
agree at t = to. Hence X'(t) = $,X(t) for all t € J and this proves (5.3.3). 
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We prove (v). Fix a smooth map y : R? + M and a smooth vector 
field Z along y, and define 
7 =o07:R2? 5M’, Z' := bsZ € Vect(y’). 
Then it follows from (ii) that 
R'(Osy,,Y)Z' = VINUZ' -ViVEZ' 
= ox (VsViZ — ViVsZ) 


do(y)R(Osy;, #Y)Z 
= (b4R) (O07, 07) 2". 


This proves (v) and Theorem 5.3.1. 


The assertions of Theorem 5.3.1 carry over in slightly modified form to 
local isometries ¢ : M — M’. In particular, the pushforward of a vector field 
on M under ¢ is only defined when ¢ is a diffeomorphism while the pushfor- 
ward of a vector field along a curve is defined for any smooth map ¢. Also, 
the pushforward of the Riemann curvature tensor under a local isometry is 
only defined locally, and local isometries satisfy the local analogue of the 
first assertion in Theorema Egregium by definition. 


Corollary 5.3.2 (Theorema Egregium for Local Isometries). Every 
local isometry 6: M — M' has the following properties. 


(i) Every vector field X along a smooth curve y: I + M satisfies (5.3.1). 
(ii) Ify:I — M is a geodesic, then so is poy: I > M'. 
(iii) Parallel transport along a smooth curve y: I — M satisfies (5.3.3). 


(iv) The curvature tensors R of M and R' of M' are related by the formula 


‘s(p) (4b(p)u, db(p)v) = db(p)Rp(u, v)do(p)* (5.3.4) 
for allp€ M and allu,v € T,M. 


Proof. Let po € M. Then, by the Inverse Function Theorem 2.2.17, there 
exists an open neighborhood U C M of po such that U’ := @(U) is an open 
subset of M’ and the restriction ¢|y :U — U’ is a diffeomorphism. This 
restriction is an isometry by Theorem 5.1.1. Hence, by Theorem 5.3.1 the 
assertions (i) and (ii) hold for the restriction of y to Ip := y~!(U) (a union 
of subintervals of J) and (iv) holds for all p € U. Since these are local state- 
ments and po was chosen arbitrary, this proves (i), (ii), and (iv). Part (iii) 
follows directly from (i) as in the proof of of Theorem 5.3.1 and this proves 
Corollary 5.3.2. aa 
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The next corollary spells out a useful consequence of Corollary 5.3.2. For 
sufficiently small tangent vectors equation (5.3.5) below already appeared 
in the proof of Theorem 5.1.1 and was used in Lemma 5.1.10 and Exer- 
cise 5.1.11. When ™M is not complete, recall the notation V, C T,M for 
the domain of the exponential map of M at a point p (Definition 4.3.5). 
For p’ € M’ denote the domain of the exponential map by V,, C Ty MM’. 


Corollary 5.3.3. Let ¢: M — M' be a local isometry and let p € M. 
Then dd(p)Vp C Vis@) and, for every v € Vp, 
$(exp,(v)) = exp) p) (do(p)v). (5.3.5) 


Proof. Let v € Vp C T,M and define y(t) := exp,(tv) for 0 < t < 1. 
Then y: [0,1] > M is a geodesic by Lemma 4.3.6, and hence so is the 
curve 7/ := 607: [0,1] > M’ by Corollary 5.3.2. Moreover, 


1'(0) = 6(7(0)) = o(p), —- ¥(0) = de(7(0))4(0) = dg(p)v 


by the chain rule. Hence it follows from the definition of the exponential 
map (Definition 4.3.5) that d@(p)uv € Vobs and 


explyy,)(d6(p)v) = 9/(1) = (41) = 6(expp(v)). 
This proves Corollary 5.3.3. 


5.3.3 GaufBian Curvature 


As a special case we shall now consider a hypersurface M Cc R™*}, ie. 
a smooth submanifold of codimension one. We assume that there exists a 
smooth map v : M — R™*! such that, for every p € M, we have 


v(p) L TpM, |v(p)| = 1. 


Such a map always exists locally (see Example 3.1.3). Note that v(p) is 
an element of the unit sphere in R™*+! for every p © M and hence we can 
regard v as a map from M to $™: v: M — S$”. Such a map is called a 
Gau8B map for M. Note that if vy: M — S? is a Gau8 map, so is —v, but 
this is the only ambiguity when M is connected. Differentiating v at p © M 


we obtain a linear map 
dyv(p) : T,M > T,p)S™ = T,M 
Here we use the fact that T,(,)S" = v(p)+ and, by definition of the Gau8 


map v, the tangent space of M at p is also equal to v(p)+. Thus dv(p) is a 
linear map from the tangent space of M at p to itself. 
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Definition 5.3.4. The Gau®ian curvature of the hypersurface M is the 
real valued function K : M — R defined by 


K (p) := det(dv(p) : T,M > T,M) 


forpeM. (Replacing v by —v has the effect of replacing K by (—1)"K; 
so K is independent of the choice of the Gauf map when m is even.) 


Remark 5.3.5. Given a subset B C M the set v(B) C S”™ is often called 
the spherical image of B. If v is a diffeomorphism on a neighborhood of 
B, the change of variables formula for an integral gives 


/ ie | Klien. 
v(B) B 


Here py and yg denote the volume elements on M and S$”, respectively. 
Introducing the notation Areay;(B) := [2 wu we obtain the formula 


c= po, See 


This says that the curvature at p is roughly the ratio of the (m-dimensional) 
area of the spherical image v(B) to the area of B where B is a very small open 
neighborhood of p in M. The sign of K(p) is positive when the linear map 
dv(p) : T,M — T,M preserves orientation and negative when it reverses 
orientation. 


Remark 5.3.6. We see that the Gaufian curvature is a natural general- 
ization of Euler’s curvature for a plane curve. Indeed if M C R? is a 
1-manifold and p € M, we can choose a curve y = (x,y) : (—é,¢€) > M such 
that y(0) = p and |4+(s)| = 1 for every s. This curve parametrizes M by the 
arclength and the unit normal vector pointing to the right with respect to 
the orientation of 7 is v(x, y) = (y, -£). This is a local Gau8 map and its 
derivative (¥, —%) is tangent to the curve. The inner product of the latter 
with the unit tangent vector 7 = (4, y) is the Gaufian curvature. Thus 


dx dy dyd?x _ dé 
“ dsds? dsds2 ds 


where s is the arclength parameter and @ is the angle made by the normal 
(or the tangent) with some constant line. With this convention K is positive 
at a left turn and negative at a right turn. 
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Exercise 5.3.7. The Gaufian curvature of a sphere of radius r is constant 


and has the value r~™. 


Exercise 5.3.8. Show that the Gaufian curvature of the surface z = x?—y? 


is —4 at the origin. 


Ge a an 


K>0 K=0 K <0 


Figure 5.1: Positive and negative Gaufian curvature. 


We now restrict to the case of surfaces, i.e. of 2-dimensional submani- 
folds of R°. Figure 5.1 illustrates the difference between positive and nega- 


tive Gaufian curvature in dimension two. 


Theorem 5.3.9 (GauBian curvature). Let M C R® be a surface and fir 


a pointpe M. Ifu,v € T,M is a basis, then 


Riu, v)v, u 
Kp) = ene uae 


Moreover, 
R(u, v)w = —K(p)(v(p), u x v)v(p) x w 


for allu,v,w € TM. 
Proof. The orthogonal projection of R® onto the tangent space 
TM = v(p)+ 
is given by the 3 x 3-matrix 
II(p) = 1 —v(p)v(p)". 


Hence 
dIl(p)u = —v(p)(du(p)u)" — (du(p)u)v(p)°. 


(5.3.6) 


(5.3.7) 


Here the first summand is the second fundamental form, which maps 7, 
to T,M + and the second summand is its dual, which maps TM + to TyM. 


Thus 
hy(v) = v(p) (dv(p)v)" :T)M + T,M-, 
hp(u)* (dv(p)u)v(p)" :Tp>M+ + TyM. 
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By the Gauf—Codazzi formula this implies 


Rp(u,v)w = hp(u)*hp(v)w — hal ee 
= (dv(p)u) (dv(p)v) "w — (dv(p)v) yw) (dv(p)u) 'w 
= (dv(p)v, w)du(p)u — (du(p : w)dv(p)v 
and hence 


(Rp(u, v)w, Zz) = (dv(p)u, z)(dv(p)u, w) — (dv(p)u, w)(dv(p)v, z). (5.3.8) 


Now fix four tangent vectors u,v, w,z € T,M and consider the composition 


R? 4, R3 4, Rp? 2, R3 


of the linear maps 


AE = €'v(p) + ut &u, 
_ fs dv(p)n, ifn + v(p), 
a. { n, if 7 € Rv(p), 


(¢,u(p)) 
CO := (¢, 2) : 
(¢, w) 


This composition is represented by the matrix 
1 0 0 
CBA=| 0 (dv(p)u,z) (dv(p)v,z) |. 
0 


Hence, by (5.3.8), we have 


(R,(u,v)w,z) = det(CBA) 

= det(A) det(B) det(C) 
(v(p),u x v)K (p)(v(p), 2 x w) 
= —K(p)(v(p),u x v)(v(p) x w, 2). 


This implies (5.3.7) and 


(Rp(u,v)v,u) = K(p)(v(p),u x v)? 
= K(p)|ux vo)? 


K(p) (Iul? lol? = (u,0)?) . 


This proves Theorem 5.3.9. 
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Remark 5.3.10. Equation (5.3.6) implies 


(Rp(u,v)w, 2) = K(p)((u, 2)(v,w) = (u,w)(v,z)) (6.3.9) 


for all p€ M and all u,v,w,z €T,M. This is proved in Theorem 6.4.8 
below. Exercise: Deduce this formula from (5.3.7). 


Corollary 5.3.11 (Theorema Egregium of Gauf). The Gaufian cur- 
vature is intrinsic, i.e. if 
¢:M— > M! 


is an isometry of surfaces in R°, then 


K=K'od: MAR. 


Proof. Theorem 5.3.1 and Theorem 5.3.9. 


Exercise 5.3.12. For m = 1 the Gaufian curvature is clearly not intrinsic 
as any two curves are locally isometric (parameterized by arclength). Show 
that the curvature K (p) is intrinsic for even m while its absolute value | K (p)| 
is intrinsic for odd m > 3. Hint: We still have the equation (5.3.8) which, 
for z = u and v = w, can be written in the form a 


(Ry(u,v)v,u) = det ( (due 


Thus, for every orthonormal basis vj,...,Um of T,M, the 2 x 2 minors of 
the matrix 


((dv(p)vi, 05) gaa, am 
are intrinsic. Hence everything reduces to the following assertion. 


Lemma. The determinant of an m xm matrix is an expression in its 2 x 2 
minors if m is even; the absolute value of the determinant is an expression 
in the 2 x 2 minors if m is odd and greater than or equal to 3. 


The lemma is proved by induction on m. For the absolute value, note the 
formula 


det(A)"™ = det(det(A)1,,) = det(AB) = det(A) det(B) 


for an m X m-matrix A where B is the transposed matrix of cofactors. 
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5.4 Curvature in Local Coordinates* 


Riemann 


Let M Cc R* be an m-dimensional manifold and let 6 = ~-! : U = O 
be a local coordinate chart on an open set U C M with values in an open 
set Q Cc R’. Define the vector fields E),..., Ey, along w by 


Ow 
These vector fields form a basis of Ty,)M for every x € Q and the coeffi- 
cients gj; : (— R of the first fundamental form are gj; = (Ej, Ej). Recall 
from Lemma 3.6.5 that the Christoffel re : Q > R are the coefficients of 


the Levi-Civita connection, defined by 


m 
ViEj = DOT ER 
k=1 


and that they are given by the formula 


: = Dols (Digje Oj Gie Oegi;)- 


Define the coefficients Ré 


ijk ° >Q— Rand Ryze : Q > R of the Riemann cur- 
vature tensor by 


R(E,, Ej) Ex = S > Bins, (5.4.1) 
(=1 
Rijre = (R(Ei, Bj) Ex, Ee) = Y Rinne (5.4.2) 


These coefficients are given by 


Rijn = 0; Vi ~ 0; Vn a y (Try jk My te) (5.4.3) 
The coefficients of the Riemann curvature tensor have the symmetries 
Rijne = —Ryike = —Rijer = Reeij (5.4.4) 
and the first Bianchi ee has the form 
Rijn + Roni + Reig = Rijne + Ryxie + Reige = 0. (5.4.5) 


Warning: Care must be taken with the ordering of the indices. Some 
authors use the notation Rhij for what we call Ri , and Rex; for what we 
call Rijre. 
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Exercise 5.4.1. Prove equations (5.4.3), (5.4.4), and (5.4.5). Use (5.4.3) 


to give an alternative proof of Theorem 5.3.1. 


GauB 


If M Cc R” is a 2-manifold (not necessarily embedded in R*), we can use 
equation (5.3.6) as the definition of the Gaufian curvature K : M —> R. 
Let »: QU bea local parametrization of an open set U C M defined on 
an open set  C R?. Denote the coordinates in R? by (a, y) and define the 
functions £, F,G:Q—- R by 


E := |)", F := (Ont, Oy), G = |dyp|*. 


We abbreviate D := EG — F?. Then the composition of the Gaufian curva- 
ture K : M — R with the parametrization w is given by 


j E F OyF — 5024 
Koy = 7a det F G 50 G 
50rE O,F —j,0,E —j02E + 0,0,F — 582G 
i E F 20, 


50,E 50,G 0 
ae (Pen) 


2/D Ox EVD 
1 O (es — FO,E — =) 
' /D Oy EVD 


This expression simplifies dramatically when F' = 0 and we get 


1 (5 0G nm 0 a 
2/EG \Orx JEG OyV/EG 
Exercise 5.4.2. Prove that the Riemannian metric 

4 


E=G=—W .W_ 35 F= 
(ieee) m 


Koyp= (5.4.6) 


on R? has constant constant curvature K = 1 and the Riemannian metric 
E=G= — ~~. F=0, 


on the open unit disc has constant curvature K = —1. 
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Chapter 6 


Geometry and Topology 


In this chapter we address what might be called the “fundamental problem 
of intrinsic differential geometry”: when are two manifolds isometric? The 
central tool for addressing this question is the Cartan—Ambrose—Hicks The- 
orem (§6.1). In the subsequent sections we will use this result to examine 
flat spaces (§6.2), symmetric spaces (6.3), and constant sectional curvature 


manifolds (§6.4). The chapter then examines manifolds of nonpositive sec- 
tional curvature and includes a proof of the Cartan Fixed Point Theorem 
(§6.5). The last three sections introduce the Ricci tensor and show that com- 
plete manifolds with positive Ricci tensor are compact (§6.6) and discuss the 


scalar curvature (§6.7) and the Weyl tensor (§6.8). a 


6.1 The Cartan—Ambrose—Hicks Theorem 


The Cartan—Ambrose—Hicks Theorem answers the question (at least locally) 
when two manifolds are isometric. In general the equivalent conditions given 
there are probably more difficult to verify in most examples than the con- 
dition that there exist an isometry. However, under additional assumptions 
it has many important consequences. The section starts with some basic 
observations about homotopy and simple connectivity. 


6.1.1 Homotopy 


Definition 6.1.1. Let M be a manifold and let I = [a,b] be a compact 
interval. A (smooth) homotopy of maps from I to M is a smooth map 
y:[0,1)xI 4 M. We often write y(t) = y(A,t) for A € [0,1] andt € I 
and call y a (smooth) homotopy between y and 7. We say the 
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homotopy has fixed endpoints if y)(a) = yo(a) and y)(b) = yo(b) for all 
r€ [0,1]. (See Figure 6.1.) 


We remark that a homotopy and a variation are essentially the same 
thing, namely a curve of maps (curves). The difference is pedagogical. We 
used the word “variation” to describe a curve of maps through a given 
map; when we use this word we are going to differentiate the curve to find 
a tangent vector (field) to the given map. The word “homotopy” is used 
to describe a curve joining two maps; it is a global rather than a local 
(infinitesimal) concept. 


Figure 6.1: A homotopy with fixed endpoints. 


Definition 6.1.2. The manifold M is called simply connected iff for 
any two curves Yyo,71 : [a,b] > M with yo(a) = y1(a) and yo(b) = y1(b) there 
exists a homotopy from yo to 7 with endpoints fixed. (The idea is that the 
space Qyq of curves from p to q is connected.) 


Remark 6.1.3. Two smooth maps 7,71 : [a,b] + M with the same end- 
points can be joined by a continuous homotopy if and only if they can be 
joined by a smooth homotopy. This follows from the Weierstrass approxi- 
mation theorem. 


Remark 6.1.4. The topological space Qp,4 of all smooth curves in M with 
the endpoints p and q is connected for some pair of points p,q € M if and 
only if it is connected for every pair of points p,q € M. (Prove this!) 


Example 6.1.5. The Euclidean space R™ is simply connected, for any two 
curves Y,71 : [a,6] > R™ with the same endpoints can be joined by the 


homotopy a(t) := yo(t) + A(1(t) — (4). 
The punctured plane C \ {0} is not simply connected, for the curves 


Yn(t) = e2 mint 0<t<1, 
are not homotopic with fixed endpoints for distinct n. 


Exercise 6.1.6. Prove that the m-sphere S$” is simply connected for m # 1. 
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6.1.2 The Global C-A-H Theorem 


Theorem 6.1.7 (Global C-A-H Theorem). Let MC R" and M' c R” 
be nonempty, connected, simply connected, complete m-manifolds. Fix two 
elements po € M and py € M' and let ®o : Ty, M — T, M' be an orthogonal 
linear isomorphism. Then the following are equivalent. 


(i) There exists an isometry 6: M > M' satisfying 


(Po) = Po; do(po) = ®o. (6.1.1) 
(ii) If (®, 7,7’) is a development satisfying the initial condition 
1(0)=po, Y(O)=p, 80) = 9, (6.1.2) 


then 
7(1) = po => ¥(1)=pp, (1) = Bo 


(iii) If (0,70, Yo) and (®1,71,%4) are developments satisfying (6.1.2), then 


yo(1) = (1) = yo(1) = 71 (1). 


(iv) If (®, 7,7) is a development satisfying (6.1.2), then ®,R, = Ri,. 


Proof. See page 260. 


Figure 6.2: Diagram for Example 6.1.8. 


Example 6.1.8. Before giving the proof let us interpret the conditions in 
case M and M’ are two-dimensional spheres of radius r and r’ respectively in 
three-dimensional Euclidean space R*?. Imagine that the spheres are tangent 
at po = py. Clearly the spheres will be isometric exactly when r = 1’. 
Condition (ii) says that if the spheres are rolled along one another without 
sliding or twisting, then the endpoint y/(1) of one curve of contact depends 
only on the endpoint 7(1) of the other and not on the intervening curve ¥(t). 
By Theorem 5.3.9 the Riemann curvature of a 2-manifold at p is determined 
by the Gaufian curvature K(p); and for spheres we have K(p) = 1/r?. 
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Exercise 6.1.9. Let 7 be the closed curve which bounds an octant as shown 
in the diagram for Example 6.1.8. Find 7’. 


Exercise 6.1.10. Show that in case M is two-dimensional, the condition 
(1) = o in Theorem 6.1.7 may be dropped from (ii). 


Lemma 6.1.11. Let 6: M > M’ be a local isometry and let y: I + M be 
a smooth curve on an interval I. Fix an element tg € I and define 


Po := (to), qo -= (Po), op := do(po). (6.1.3) 


Then there exists a unique development (®,y,7') of M along M' on the 
entire interval I satisfying the initial conditions 


7 (to) = 40; P(to) = Bo. (6.1.4) 


This development is given by 


V(t) = 97), ®(t) = dd(y(@)) (6.1.5) 
fort el. 


Proof. Define 
(= 9), Bt) == de(y(t)) 

for t€ J. Then ¥/(t) = ®(t)¥(t) for all t € I by the chain rule, and ev- 
ery vector field X along y satisfies BVX = V’/(®X) by Corollary 5.3.2. 
Hence (©, 7,7) is a development by Lemma 3.5.19. By (6.1.3) this devel- 
opment satisfies the initial condition (6.1.4). Hence the assertion follows 
from the uniqueness result for developments in Theorem 3.5.21. This proves 
Lemma 6.1.11. 


Proof of Theorem 6.1.7. We first prove a slightly different theorem. Namely, 
we weaken condition (i) to assert that ¢ is a local isometry (i.e. not neces- 
sarily bijective), and prove that this weaker condition is equivalent to (ii), 
(iii), and (iv) whenever M is connected and simply connected and M’ is 
complete. Thus we drop the hypotheses that M be complete and M’ be 
connected and simply connected. 

We prove that (i) implies (ii). Given a development as in (ii) we have, 
by Lemma 6.1.11, 


(1) = $(7(1)) = $(p0) = Po, ®(1) = db(7(1)) = 4d (po) = Oo, 


as required. 
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We prove that (ii) implies (iii) when M’ is complete. Choose develop- 
ments (®;,7:,7;) for 7 = 0,1 as in (iii). Define a curve y : [0,1] ~ M by 
“composition”, i.e. 


yo(2t),  O<t<1/2, 
y= ° 
m(2—2t), 1/2<t<1, 


so that 7 is continuous and piecewise smooth and y(1) = po. By Theo- 
rem 3.5.21 there exists a development (®, 7,7’) on the interval [0,1] satis- 
fying (6.1.2) (because M’ is complete). Since y(1) = po it follows from (ii) 


that 7'(1) = pp and ®(1) = ®o. By the uniqueness of developments and the 
invariance under reparametrization, we have 


($9 (2t), y0(2t), ¥4(2t)), 0<t<1/2, 
UIs) = { (1(2 nee 24), y(2—21)), 1/2<t<1. 


Hence 7(1) = 7/(1/2) = y,(1) as required. 

We prove that (iii) implies (i) when M’ is complete and M is connected. 
Define the map ¢: M — M’ as follows. Fix an element p € M. Since M is 
connected, there exists a smooth curve y: [0,1] + M such that 7(0) = po 
and 7(1) =p. Since M’ is complete, there exists a development (®, 7, y’) 
with (0) = pp and ®(0) = ®o (Theorem 3.5.21). Now define $(p) := 7'(1). 
By (iii) the endpoint p’ := 7/(1) is independent of the choice of the curve 4, 
and so @ is well-defined. We prove that this map ¢ satisfies the following 
(a) If (®, 7,7’) ts a development satisfying y(0) = po, y'(0) = pp, &(0) = Bo, 
then o(y(t)) = Y(t) forO<t<l. 

(b) If p,q €M satisfy 0 < d(p,q) < inj(p;M) and d(p,q) < inj(o(p), M’), 
then d'(d(p), $(q)) = d(p, 9). 

That ¢ satisfies (a) follows directly from the definition and the fact that the 
triple (®:,%,7;,) defined by ®,(s) := ®(st), %(s) := y(st), 7(s) := (st) 
for 0 < s <1 is a development. To prove (b), choose v € T,M such that 


|v] =d(p,q) exp, (v) = 4 
(Theorem 4.4.4) and let 7: [0,1] + M be a smooth curve with 


7(0)=po, y(t) = exp, ((2¢ — 1)v) 


for 5 <t<1. Let (®,7,7’) be the unique development of M along M’ 
satisfying y/(0) = pp and ®(0) = ®p (Theorem 3.5.21). Then, by (a), 


V($)=¢(), (0) =4(4). 
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Also, by part (ii) of Lemma 3.5.19 with X = 4%, the restriction of y to the 
interval [},1] is a geodesic. Thus 7/(t) = expgip)((2t — 1)v") for 5 <t <1, 


where the tangent vector v' € Typ) M’ is given by vu! := 7($) = ®($)v and 
hence satisfies |v’| = |v] = d(p,q) < inj(¢(p), M’). Thus it follows from The- 
orem 4.4.4 that d'(4(p), 6(q)) = d'(d(p), XP yp) (v')) = |v'| = d(p,q) and this 
proves (b). It follows from (b) and Theorem 5.1.1 that ¢ is a local isometry. 


We prove that (i) implies (iv). Given a development as in (ii) we have 


V(t) = oC), O(t) = de(7(t)) 


for every t, by Lemma 6.1.11. Hence it follows from part (iv) of Corol- 
lary 5.3.2 (Theorema Egregium for local isometries) that 


P(t). Ry) = (PR) y(t) = Ry) 


for all t as required. 

We prove that (iv) implies (iii) when M’ is complete and M is simply 
connected. Choose developments (®;,7;,7;) for i = 0,1 as in (iii). Since M 
is simply connected there exists a homotopy 


[0,1] x [0,1] > M: (At) ¥Q, 4) = y(t) 


from yo to 7, with endpoints fixed. By Theorem 3.5.21 there is, for each A, 
a development (®), 7), 7) on the interval [0,1] with initial conditions 


(0) =p,  ®y(0) = Bo 


(because M’ is complete). The proof of Theorem 3.5.21 also shows that 
y(t) and ®)(t) depend smoothly on both t and A. We must prove that 


11 (1) = y0(1). 
To see this we will show that, for each fixed t, the curve 
Ae (®)(t), y(t), 7 (t)) 


is a development; then by the definition of development we have that the 
curve A ++ 7\(1) is smooth and 


Avy (1) = © (L)Ayy(1) = 0 


as required. 


6.1. THE CARTAN-AMBROSE-HICKS THEOREM 263 


First choose a basis €1,...,€m of T;,M and extend it to obtain vector 
fields E; € Vect(y) along the homotopy y by imposing the conditions that 
the vector fields t +> E;(A,t) be parallel, i.e. 

VE (A, t) a 0, E;(A, 0) = &j. (6.1.6) 
Then the vectors £1(A,t),...Em(A,t) form a basis of T,, 4M for all \ and t. 
Second, define the vector fields E; along 7 by 
Ey(A, t) = ®)(t)E;(A, t) (6.1.7) 
so that VE! = 0. Third, define the functions é1,...,€™ : [0,1]? — R by 


Oy Sek, On’ = So EE. (6.1.8) 
i=1 i=1 


Here the second equation follows from (6.1.7) and the fact that ®), 0,7 = 0:7’. 
Now consider the vector fields 
X':=dY, Ye HVE, (6.1.9) 
along 7’. They satisfy the equations 


Vix! = ViOxy = Vi Or! = VA (D5 CE) = So (One Bi + CY) 
i=l i=1 
and 
WIV! = VIVE] — VAVIE! = B(O/ Ox) 
To sum up we have X’(A,0) = Y/(A,0) = 0 and 


m 


Vix’ = S(O HB +EY/), VY) = RO, Onx7)E}. (6.1.10) 
i=1 
On the other hand, the vector fields 


Xx! = POY, A —_— ® WE; (6.1.11) 


along 7’ satisfy the same equations, namely 


ViX! = ®)VOxy = OVA = O Vy (= ce 


t=1 
= ®) os (eB; + LV E;) = > (OnE; + EY/) , 
1=1 r= 1, 


ViY/ = ®)(ViVa Ei — WVL;) = ©) RY, Avy) Ei 
= R'(®) dy, ®, 0.7) OE; = R’(Op7/, X")E;. 


Here the last but one equation follows from (iv). 
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Since the tuples (6.1.9) and (6.1.11) satisfy the same differential equa- 


tion (6.1.10) and vanish at t = 0 they must agree. Hence 


Oy = ©)Oy7, \ Ei, = ®VE; 


fori =1,...,m. This says that \ +> (®)(t), ya(t), ¥\(t)) is a development. 
For t = 1 we obtain O0)7¥(A, 1) = 0 as required. 

Now the modified theorem (where ¢ is a local isometry) is proved. The 
original theorem follows immediately. Condition (iv) is symmetric in M 
and M’. Thus, if we assume (iv), there are local isometries ¢ : M — M’ 
and w : M' > M satisfying ¢(p0) = pop, dé(po) = Bo and v(p5) = po, 
dib(p’) = 51. But then yo ¢ is a local isometry with ~ o ¢(po) = po and 
d(wo d)(po) = id. Hence yo ¢ is the identity. Similarly ¢o w is the identity 
so ¢ is bijective (and 7 = ¢~') as required. This proves Theorem 6.1.7. 


Remark 6.1.12. The proof of Theorem 6.1.7 shows that the various im- 
plications in the weak version of the theorem (where ¢ is only a local 
isometry) require the following conditions on M and M’: 


(i) always implies (ii), (iii), and (iv); 
(ii) implies (iii) whenever M’ is complete; 
(iii) implies (i) whenever M’ is complete and M is connected; 


(iv) implies (iii) whenever M’ is complete and M is simply connected. 


Remark 6.1.13. The proof that (iii) implies (i) in Theorem 6.1.7 can be 
slightly shortened by using the following observation. Let ¢ : MM’ be 
a map between smooth manifolds. Assume that poy is smooth for every 
smooth curve y: [0,1] + M. Then ¢ is smooth. 


Corollary 6.1.14. Let M and M' be nonempty, connected, simply con- 
nected, complete Riemannian manifolds and let 6: M — M’ be a local 
isometry. Then @ is bijective and hence is an isometry. 


Proof. This follows by combining the weak and strong versions of the global 
C-A-H Theorem 6.1.7. Let po € M and define pp := (po) and ®o := dd(po). 
Then the tuple M, M’ , Po; Pp, Po satisfies condition (i) of the weak version 
of Theorem 6.1.7. Hence this tuple also satisfies condition (iv) of Theo- 
rem 6.1.7. Since M and M’ are connected, simply connected, and complete 
we may apply the strong version of Theorem 6.1.7 to obtain an isometry 
w:M — M' satisfying (po) = pp and dy(po) = ®o. Since every isometry 
is also a local isometry and M is connected it follows from Lemma 5.1.10 
that ¢(p) = w(p) for all p € M. Hence ¢ is an isometry, as required. 
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Remark 6.1.15. Refining the argument in the proof of Corollary 6.1.14 one 
can show that a local isometry ¢ : M — M’ must be surjective whenever 
is complete and M’ is connected. None of these assumptions can be removed. 
(Take an isometric embedding of a disc in the plane or an embedding of 
a complete space M into a space with two components, one of which is 
isometric to M.) 

Likewise, one can show that a local isometry ¢ : M — M’ must be 
injective whenever M is complete and connected and M’ is simply connected. 
Again none of these asumptions can be removed. (Take a covering R + S", 
or a covering of a disjoint union of two isometric complete simply connected 
spaces onto one copy of this space, or some noninjective immersion of a disc 
into the plane and choose the pullback metric on the disc.) 


6.1.3. The Local C-A-H Theorem 


Theorem 6.1.16 (Local C-A-H Theorem). Let M and M' be smooth 
m-mantfolds, let pp € M and py € M', and let ®o : T,,M — Ty M" be an 
orthogonal linear isomorphism. Let r > 0 be smaller than the injectvity radii 
of M at po and of M' at pp and define 

U, :={p € M|d(po,p) <r}, Uy := {p' © M’|d'(pp,p') <r}. 
Then the following are equivalent. 
(i) There exists an isometry ¢: U, + U}. satisfying (6.1.1). 
(ii) If (®, y,7) is a development on an interval I C R with 0 € I, satisfying 
the initial condition (6.1.2) as well as y(I) C U, and 7/(1) C US, then 


7(1) = Po = Y'(1) =p, B(1) = Bo. 
(iii) If (®0, yo, 9%) and (®1,71, 7%) are developments as in (ii), then 
yo(1) = 11 (1) = yo(1) = 711). 


(iv) [fv € T,,M with |v| <r and 
y(t) = exp,, (tv), ¥(é) := eXPyy (t®ov), (t) := O1,(t,0)Go®,(0,t), 
then ®(t),. Ry) = Rot) forO0<t<1. 
If these equivalent conditions are satisfied, then 
b(exp,,(v)) = expyy (®ov) 
for allu€ TM with |v| <r. 


Proof. See page 266. 


266 CHAPTER 6. GEOMETRY AND TOPOLOGY 
Lemma 6.1.17. Letp¢ M andv,w €T,M. For0<t <1 define 


0 
y(t) := exp(tv), Xe) 2= Dr 7 exp, (t(v + Aw)) € Ty) M. 
Then 
ViVX +R(X,5)¥=0, X(0)=0, YX(0)=w. (6.1.12) 


A vector field along y satisfying the first equation in (6.1.12) is called a 
Jacobi field along ¥. 


Proof. Define 
TA, t) = exp, (t(v + Aw)), ALA, t) = OVA, t) 


for all X and t. Since y(A,0) = p for all A we have X(A,0) = 0 and 
d 
Moreover, V0;7 = 0 and hence 


ViViX = VMMOAY 
= VVyvr7 - VV 
= R(HvY,AY)nY 
= R(vY,X)%7- 


This proves Lemma 6.1.17. 


Proof of Theorem 6.1.16. The proofs (i) ==> (ii) => (iii) => (i) = (iv) 

are as before; the reader might note that when L(y) < r we also have 

L(9’) < r for any development so that there are plenty of developments 

with y : [0,1] — U, and 7 : [0,1] — U!. The proof that (iv) implies (i) is 

a little different since (iv) here is somewhat weaker than (iv) of the global 

theorem: the equation ®,R = R’ is only assumed for certain developments. 
Hence assume (iv) and define ¢ : U, — U}. by 


o:= expiy oP) 0 EXP, :U, + Uf. 


We must prove that @ is an isometry. Thus we fix a point gq € U, and a 
tangent vector u € TyM and choose v,w € T,M with |v| < r such that 


EXP po (v) = q; dexPy, (v)w =U. (6.1.13) 


6.1. THE CARTAN-AMBROSE-HICKS THEOREM 267 
Define ¥ : [0,1] > U;,, 7 : [0,1] > Us, X © Vect(y), and X’ € Vect(7’) by 


7(t) = exp,, (tv), At) = exp,, (¢(v + Aw)) 


AX, 


q(t) = exp) (t®pv), X(t) := exphy (t(@ov + APow)). 


A=0 


an 
Then, by definition of ¢, we have 
y! := $07, dd(y)X = X’. (6.1.14) 
By Lemma 6.1.17, X is a solution of (6.1.12) and X’ is a solution of 
ViViX! = RO, XO’, =X(A,0) =0, VYX(A,0) = @ow. (6.1.15) 
Now define ®(¢) : T)4)M — Ty) M’ by 
P(t) := &,,(t, 0) o®, (0, ¢). 


Then © intertwines covariant differentiation. Since 7 and 7’ are parallel 
vector fields with +/(0) = ®pv = ®(0)4(0), we have 


for every t. Moreover, it follows from (iv) that ®,R, = Ri. Combining this 
with (6.1.12) we obtain 


ViVi(OX) = OV.VX = R'(by, BX) &4 = R'(4/, OX)¥. 


Hence the vector field ®X along y also satisfies the initial value prob- 
lem (6.1.15) and thus 


BX = X' = do(7)X. 
Here we have also used (6.1.14). Using (6.1.13) we find 


y(1) =exp,,(v)=9q,  X(1) = dexp,,(v)w =u, 
and so 
do(qju = db(y(1))X(1) = X"(1) = &(1)u. 
Since ®(1) : Tq) M > T,1)M’ is an orthogonal transformation this gives 


|de(q)ul = |@()ul = ful. 


Hence ¢ is an isometry as claimed. This proves Theorem 6.1.16. 
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6.2 Flat Spaces 


Our aim in the next few sections is to give applications of the Cartan- 
Ambrose-Hicks Theorem. It is clear that the hypothesis ®,R = R’ for all 
developments will be difficult to verify without drastic hypotheses on the 
curvature. The most drastic such hypothesis is that the curvature vanishes 
identically. 


Definition 6.2.1. A Riemannian manifold M is called flat iff the Riemann 
curvature tensor R vanishes identically. 


Theorem 6.2.2. Let MC R” be a smooth m-manifold. 


(i) M is flat if and only if every point has a neighborhood which is isometric 
to an open subset of R™, i.e. at each point p € M there exist local coordinates 
x',...,2™ such that the coordinate vectorfields E; = 0/0x' are orthonormal. 


(ii) Assume M is connected, simply connected, and complete. Then M is 
flat if and only if there is an isometry 6: M — R™ onto Euclidean space. 


Proof. Assertion (i) follows immediately from Theorem 6.1.16 and (ii) fol- 


lows immediately from Theorem 6.1.7. 


Exercise 6.2.3. Carry over the Cartan—Ambrose—Hicks theorem and The- 
orem 6.2.2 to the intrinsic setting. 


Exercise 6.2.4. A one-dimensional manifold is always flat. 
Exercise 6.2.5. If M, and Mp are flat, so is M = My x Mo. 
Example 6.2.6. By Exercises 6.2.4 and 6.2.5 the standard torus 

se {e= (Pigssaveye) ec” | a= = |aanl= 1} 
is flat. 


Exercise 6.2.7. For a,b > 0 and c > 0 define M Cc C? by 


M := M(a,b,c) := { (u,v, w) E c] lu| =a, |v) = 0b, w on}. 
Then M is diffeomorphic to a torus (a product of two circles) and M is flat. If 
a’,b' > Oand dc > 0, prove that there is an isometry ¢ from M’ = M(a’,b’,c’) 
to M = M(a, b,c) if and only if the triples (a’, b’, c’) and (a,b,c) are related 
by a permutation. 
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Hint: Show first that an isometry ¢: M’ > M that satisfies the condi- 
tion ¢(a’,b',c’) = (a,b,c) must have the form 


u\% fy'\F8 okey 5 Ti VonE ar ar B+6 
wdc (0(3) (F) (3) @) eZ) G) 
for integers a, 3,7, 6 that satisfy ad — By = +1. Show that this map ¢ is an 
isometry if and only if 


+0? =a? a? +7? +(a+7)?e?, 


c” = aBa? + y5b? + (a+ 7)(B + d)c?, 
p? ae re = Ba? as §2b2 a (B + 6)%c?. 


Exercise 6.2.8 (Developable manifolds). Let n = m-+1 and let E(t) be 
a one-parameter family of hyperplanes in R”. Then there exists a smooth 
map u: R—- R” such that 


E(t)=u(t)t, — u(t) =1, (6.2.1) 


for every t. We assume that u(t) 4 0 for every t so that u(t) and u(t) are 
linearly independent. Show that 


L(t) =a) 04+ = lim E(t) E(s). (6.2.2) 


Thus L(t) is a linear subspace of dimension m — 1. Now let y : R > R” be 
a smooth map such that 


(y(t), u(t)) =0, (Y(t), W(t) #0 (6.2.3) 
for all t. This means that 7(t) € E(t) and ¥(t) ¢ L(t); thus E(t) is spanned 
by L(t) and ¥(t). For t € R and e > 0 define 

L(t). := {vu € L(t) | |v] < e}. 


Let J Cc R be a bounded open interval such that the restriction of y to the 
closure of I is injective. Prove that, for ¢ > 0 sufficiently small, the set 


My = (r+ LWe) 
tel 


is a smooth manifold of dimension m = n — 1. A manifold which arises this 
way is called developable. Show that the tangent spaces of Mp are the 
original subspaces E(t), i.e. 


TpMo = E(t) for pe y(t) + Lt). 
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(One therefore calls Mo the “envelope” of the hyperplanes y(t) + E(t).) 
Show that Mo is flat. (Hint: use Gau8-Codazzi.) If (®,7,7’) is a de- 
velopment of Mo along R’™, show that the map ¢ : Mo — R™, defined 
by 

o(y(t) + v) = 7) + Bu 


for v € L(t)-, is an isometry onto an open set Mj C R™. Thus a development 
“unrolls” Mop onto the Euclidean space R™. When n = 3 and m = 2 one 
can visualize Mp as a twisted sheet of paper (see Figure 6.3). 


Figure 6.3: Developable surfaces. 


Remark 6.2.9. Given a codimension-1 submanifold 
M Cc R@™1 


and acurve y : R + M we may form the osculating developable Mp to M 
along y by taking 
Ee) = Ty) M. 


This developable has common affine tangent spaces with M along y as 
Ty) Mo =f) = 7 


for every t. This gives a nice interpretation of parallel transport: Mp may be 
unrolled onto a hyperplane where parallel transport has an obvious meaning 
and the identification of the tangent spaces thereby defines parallel transport 
in M. (See Remark 3.5.16.) 


Exercise 6.2.10. Each of the following is a developable surface in R?. 


(i) A cone on a plane curve ['c H, ice. 
M ={tp+(1—-2)q|t>0,¢eET} 


where H C R° is an affine hyperplane, p € R°\H, andIT C H isa 1-manifold. 
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(ii) A cylinder on a plane curve [, i.e. 
M={q+tv|qeET,teR} 


where H and [ are as in (i) and v is a fixed vector not parallel to H. (This 
is a cone with the cone point p at infinity.) 


(iii) The tangent developable to a space curve y: R > R?°, ice. 
M = {y(t) + 8¥(t)| |t— to] <¢,0< 8 <e}, 


where *+(to) and 4(to) are linearly independent and ¢ > 0 is sufficiently small. 
(iv) The paper model of a Mobius strip (see Figure 6.3). 


ae: 
\ / 
\ { 
K aes | 
_——-, 
8 


Figure 6.4: A circular one-sheeted hyperboloid. 


Remark 6.2.11. A 2-dimensional submanifold M C R? is called a ruled 
surface iff there is a straight line in M through every point. Every devel- 
opable surface is ruled, however, there are ruled surfaces that are not devel- 
opable. An example is the manifold M = {y(t) + s¥(t) | |t — to] < ¢, |s| < e} 
where 7 : R > R? is a smooth curve with |4| = 1 and 4(to) 4 0, and e > 0 is 
sufficiently small; this surface is not developable in general. Other examples 
are the elliptic hyperboloid of one sheet 


2 
M := (ov z) €R° = raed aa — i} (6.2.4) 


depicted in Figure 6.4, the hyperbolic paraboloid 


oy? 
Mim { (aya) ER = 5-4}. (6.2.5) 
a 


(both with two straight lines through every point in M), Pliicker’s conoid 


Ms= {(ay2) ERS 


2xry 
2 2 
0,z= 6.2.6 
Pp 4£0,2= so} (6.2.6) 
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the helicoid 


M := {tm eR 


an wv} (6.2.7) 


Vere 


and the Mobius strip 


cos(s) cos(s/2) cos(s) and 
MS sin(s) }| += | cos(s/2)sin(s) ana (6.2.8) 
0 sin(s/2) Soe 


These five surfaces have negative Gaufian curvature. The Mobius strip 
in (6.2.8) is not developable, while the paper model of the Mobius strip is. 
The helicoid in (6.2.7) is a minimal surface, i.e. its mean curvature (the 
trace of the second fundamental form) vanishes. A minimal surface which 
is not ruled is the catenoid 


M := {(2,y,2) € R®|2? + y? = c? cosh (z/c)}. 


(Exercise: Prove all this.) 


6.3. Symmetric Spaces 


In the last section we applied the Cartan-Ambrose-Hicks Theorem in the flat 
case; the hypothesis ®, R = R’ was easy to verify since both sides vanish. To 
find more general situations where we can verify this hypothesis note that 
for any development (®, 7,7’) satisfying the initial conditions 


7(0) = Po; (0) = Po. #(0) = Go, 
we have 
O(t) = O1,(t, 0) &o%,(0, t) 
so that the hypothesis ®,R = R’ is certainly implied by the three hypotheses 
©,(t,0)xRp = Rye) 
! / _ / 
Ds (t, 0).Ry = Rot) 
_ / 
(®o)-Ryy = Rey. 
The last hypothesis is a condition on the initial linear isomorphism 
®):7,,M —> Ty, M’ 
while the former hypotheses are conditions on M and M’ respectively, 
namely, that the Riemann curvature tensor is invariant by parallel trans- 


port. It is rather amazing that this condition is equivalent to a simple 
geometric condition as we now show. 
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6.3.1 Symmetric Spaces 


Definition 6.3.1. A Riemannian manifold M is called symmetric about 
the point p € M iff there exists a (necessarily unique) isometry 


¢:M3M 


satisfying 
d(p)=p, ddb(p) = —id; (6.3.1) 


M is called asymmetric space iff it is symmetric about each of its points. 
A Riemannian manifold M is called locally symmetric about the point 
peEM “a, forr > 0 sufficiently small, there exists an isometry 


$: U,(p, M) — U,(p, M), U-(p,M) := {q € M |d(p,q) <r}, 


satisfying (6.3.1); M is called a locally symmetric space iff it is locally 
symmetric about each of its points. 


Remark 6.3.2. The proof of Theorem 6.3.4 below will show that, if M is 


locally symmetric, the isometry ¢ : U,(p,M) > U,(p,M) with ¢(p) = p 
and d@(p) = —id exists whenever 0 < r < inj(p). 


Exercise 6.3.3. Every symmetric space is complete. Hint: If y: I — M is 
a geodesic and ¢: M — M is asymmetry about the point 7(to) for to € J, 
then $(7(to + t)) = y(to —t) for all t € R with to+¢,to —t eI. 


Theorem 6.3.4. Let M C R” be an m-dimensional submanifold. Then the 
following are equivalent. 


(i) M is locally symmetric. 


(ii) The covariant derivative VR (defined below) vanishes identically, i.e. 
(WR)p(v1, v2)w = 0 


for allpe M and v,v1,v2,w € TpM. 


(iii) The curvature tensor R is invariant under parallel transport, i.e. 
®,(t, S)+Ry(s) = Ry) (6.3.2) 


for every smooth curve y:R—- M and all s,t € R. 


Proof. See page 277. 
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Corollary 6.3.5. Let M and M’ be locally symmetric spaces and fiz two 
points po © M and py € M’, and let ®o : T,,M > Ty, M! be an orthogonal 
linear isomorphism. Let r > 0 be less than the injectivity radius of M at po 
and the injectivity radius of M' at py. Then the following holds. 

(i) There exists an isometry ¢: U,(po,M) > U;(po, M’) with o(po) = po 
and dd(po) = ®o if and only if ®o intertwines R and R’, i.e. 


(©0)«Rpo = Rip. (6.3.3) 


(ii) Assume M and M' are connected, simply connected, and complete. 
Then there exists an isometry 6: M — M' with 6(po) = pp and dd(po) = ®o 
if and only if ®o satisfies (6.3.3). 


Proof. In (i) and (ii) the “only if” statement follows from Theorem 5.3.1 
(Theorema Egregium) with &9 := d¢(po). To prove the “if” statement, let 
(®, 7, 7’) be a development satisfying y(0) = po, y'(0) = pp, and ®(0) = Go. 
Since R and R’ are invariant under parallel transport, by Theorem 6.3.4, it 
follows from the discussion in the beginning of this section that ®,R = R’. 
Hence assertion (i) follows from the local C-A-H Theorem 6.1.16 and (ii) 
follows from the global C-A-H Theorem 6.1.7 


Corollary 6.3.6. A connected, simply connected, complete, locally symmet- 
ric space 1s symmetric. 


Proof. Corollary 6.3.5 (ii) with M’ = M, pj = po, and ®p = —id. 


Corollary 6.3.7. A connected symmetric space M is homogeneous; i.e. 
given p,q © M there exists an isometry 6: M > M with o(p) = ¢. 


Proof. If M is simply connected, the assertion follows from part (ii) of Corol- 
lary 6.3.5 with M = M’, po = p, pp =, and ® = ©,(1,0):7,M > TM, 
where 7 : [0,1] > M is a curve from p to q. If M is not simply connected, 
we can argue as follows. There is an equivalence relation on M defined by 


p~q :<—  disometry ¢: M > M 32 d(p) =4¢. 


Let p,q € M and suppose that d(p,q) < inj(p). By Theorem 4.4.4 there 
is a unique shortest geodesic y : [0,1] + M connecting p to q. Since M is 
symmetric there is an isometry ¢: M — M such that ¢(7(1/2)) = y(1/2) 
and d¢(7(1/2)) = —id. This isometry satisfies @(y(t)) = y(1 — t) and hence 
o(p) = gq. Thus p ~ q whenever d(p,q) < inj(p). This shows that each 
equivalence class is open, hence each equivalence class is also closed, and 
hence there is only one equivalence class because M is connected. This 


proves Corollary 6.3.7. 
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6.3.2 Covariant Derivative of the Curvature 


For two vector spaces V,W and an integer k > 1 we denote by L*(V,W) 
the vector space of all multi-linear maps from V* = V x --- x V to W. 
Thus £1(V,W) = L(V,W) is the space of all linear maps from V to W. 


Definition 6.3.8. The covariant derivative of the Riemann curvature 
tensor assigns to every p € M a linear map 


(VR)p : TpM —> £L2(TpM, L(TpM, TypM)) 
such that 


(VR)(X)(X1, X2)¥ = Vx (R(X1, X2)¥) — R(VxX1, X2)¥ 
— R(X1, Vx Xe)¥ — R(X1, X2)VxY 


(6.3.4) 
for all X, X1, X2,Y € Vect(M). We also use the notation 
(WR)p = (VR)p(v) 

for p € M and v € T,M so that 

(Vx R)(X1, X2)¥ := (VR)(X)(X1, X2)¥ 
for all X,X1, Xo, Y € Vect(M). 
Remark 6.3.9. One verifies easily that the map 

Vect(M)* — Vect(M) : (X,X1,X2,Y) 4 (Vx R)(X1, X9)Y, 


defined by the right hand side of equation (6.3.4), is multi-linear over the 
ring of functions ¥(M). Hence it follows as in Remark 5.2.12 that VR is 
well defined, i.e. that the right hand side of (6.3.4) at p € M depends only 


on the tangent vectors X(p), X1(p), X2(p), Y (p). 


Remark 6.3.10. Let y : J + M be a smooth curve on an interval J C R 
and 
X1, X,Y € Vect(7) 


be smooth vector fields along 7. Then equation (6.3.4) continues to hold 
with X replaced by ¥ and each Vx on the right hand side replaced by the 
covariant derivative of the respective vector field along y: 


(Vi R)(X1, Xa)¥ = V(R(X1, X2)¥) — R(V KX, X2)¥ 


(6.3.5) 
SRG oY RUG ove 
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Theorem 6.3.11. (i) Ify:R— M is a smooth curve such that y(0) = p 
and +(0) = v, then 
d 
(VA); = 7 = &, (0, t) Rye) (6.3.6) 


(ii) The covariant derivative of the Riemann curvature tensor satisfies the 
second Bianchi identity 


(Vx R)(Y, Z) + (WR)(Z, X) + (VzR)(X,Y) = 0. (6.3.7) 


Proof. We prove (i). Let v1,v2,w € T,M and choose parallel vector fields 
X1,X2,Y € Vect(y) along + satisfying the initial conditions X1(0) = v4, 
X2(0) = v2, Y(0) = w. Thus 


X1(t) = ®,(t,0)v1, X2(t) = &,(t,0)ve, Y(t) = ®,(t,0)w. 
Then the last three terms on the right vanish in equation (6.3.5) and hence 
(WW R)(v1, v2)w = V(R(X1, X2)Y)(0) 


d 
= Fl Prom Ryn), X2)¥ O) 
d 
dt t=0 (0) Ryn (®t, Oar, @,(t,0)v2)®,(t, 0)w 
d 
~ at (©, (0, t)»Ry(1)) (v1, v2)w. 
t=0 


Here the second equation follows from Theorem 3.3.6. This proves (i). 
We prove (ii). Choose a smooth function y : R® — M and denote by 
(r,s,t) the coordinates on R°. If Y is a vector field along 7, we have 
(Va,7R)(0s7; Ovy)Y = Vr (R(0s7, AY) _ R(0s7, OY) VrY 
—R(V,0sy, #Y)Y — R(Osy, V-RV)Y 
= V(VUY — VY) — (ViVi — WV) VY 
+R(Or7, VrOsy)Y > R(Os7; ViOry)Y. 
Permuting the variables r, s,¢ cyclically and taking the sum of the resulting 
three equations we obtain 
(Va,+-R) (Os, OvY)Y ot (Va,+R) (AY; Ory) ¥ TF (Va,7R) (On, Osy)Y 
= Vi (VsMY — ViVsY) — (VeVi — ViVs) Vr¥ 
+V, (ViV-Y — V-VY) — (VV, — VeV4) Va¥ 
+V; (V-VeY _ Va Vr¥ ) _ (V,Vs _ Vs Vr) MY. 


The terms on the right cancel out. This proves Theorem 6.3.11. 
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Proof of Theorem 6.3.4. We prove that (iii) implies (i). This follows from 
the local Cartan—Ambrose—Hicks Theorem 6.1.16 with 


Po=Po=p, Gy =—id: TM > TpM. 

This isomorphism satisfies 
(G9).y = Rp. 
Hence it follows from the discussion in the beginning of this section that 
6,R = RP’ 
for every development (®,7, 7’) of M along itself satisfying 
10) =O) =p, — ®(0) = —id. 
Hence, by the local C-A-H Theorem 6.1.16, there is an isometry 
¢:U,(p,M) > U;(p, M) 


satisfying 
o(p)=p, — dd(p) = —id 
whenever 0 < r < inj(p; M). 

We prove that (i) implies (ii). By Theorem 5.3.1 (Theorema Egregium), 
every isometry ¢ : M — M"’ preserves the Riemann curvature tensor and 
covariant differentiation, and hence also the covariant derivative of the Rie- 
mann curvature tensor, i.e. 


ox(VR) = V'R'. 
Applying this to the local isometry ¢ : U,(p, M) > U;,(p, M) we obtain 
(Vag(pyv2?) o(p) APP) v1, dbp) v2) = de(p) (WR) (v1, v2)d6(p) | 
for all v,v1,v2 € T,M. Since 
do(p) = —id 


this shows that VR vanishes at p. 
We prove that (ii) imlies (iii). If VR vanishes, then equation (6.3.6) in 
Theorem 6.3.11 shows that the function 


st ®(t, 5) + Rey(s) = ©,(t,0),®,(0, 5) x Fy(s) 


is constant and hence is everywhere equal to Ry). This implies (6.3.2) and 
completes the proof of Theorem 6.3.4. 
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Covariant Derivative of the Curvature in Local Coordinates 


Let 6: U > Q be a local coordinate chart on M with values in an open set 
Q Cc R™, denote its inverse by ~ := ¢7!:Q > U, and let 


3) 
EAg) = ve r) € Ty2)M rEeQ, 2 ree 


be the local frame of the tangent bundle determined by this coordinate 
chart. Let ry. : Q — R denote the Christoffel symbols and Rije :Q—>R the 
coefficients of the Riemann curvature tensor so that 


iEy = OV En, — R(Bi, Ej) Ex = D0 Rijn Ee. 
£ 


Given i,j,k, @ € {1,...,m} we can express the vector field (Vz, R)(£;, Ex) Ex 
along ~ for each x € 2 as a linear combination of the basis vectors E;(z) 
This gives rise to functions 


ViRine: 24R 


defined by 
(Ve, R)(E;, Ex) Ee = Da ne (6.3.8) 


These functions are given by 


ViR Ge = ORF Ke + py Ti Ripe 


(6.3.9) 
a yy Le jake — py Tig ty, jue > v7, Ie 
LU 
The second Bianchi identity has the form 
V,RY Tke + V; 7 Rise + ViRije = = 0. (6.3.10) 


Exercise: Prove equations (6.3.9) and (6.3.10). Warning: As in §5.4, 
care must be taken with the ordering of the indices. Some authors use the 
notation ViRein for what we call ViR ine: 


6.3.3 Examples and Exercises 
Example 6.3.12. Every flat manifold is locally symmetric. 


Example 6.3.13. If MM, and M) are (locally) symmetric, so is M, x Mo. 
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Example 6.3.14. M = R” with the standard metric is a symmetric space. 
Recall that the isometry group Z(R™) consists of all affine transformations 


of the form 
o(a) = Ax +b, AéO(m), be R”. 


(See Exercise 5.1.4.) The isometry with fixed point p € R™ and d¢(p) = —id 
is given by $(x) = 2p — x for x € R™. 


Example 6.3.15. The flat tori of Exercise 6.2.7 in the previous section are 
symmetric (but not simply connected). This shows that the hypothesis of 
simple connectivity cannot be dropped in part (ii) of Corollary 6.3.5. 


Example 6.3.16. Below we define manifolds of constant curvature and 
show that they are locally symmetric. The simplest example, after a flat 
space, is the unit sphere S” = {2 € R™*!| || = 1}. The symmetry ¢ of 
the sphere about a point p € M is given by 


(x) == —# + 2(p, x) p 


for x € S™. This extends to an orthogonal linear transformation of the 
ambient space. In fact the group of isometries of S$” is the group O(m + 1) 
of orthogonal linear transformations of R’*! (see Example 6.4.16 below). 
In accordance with Corollary 6.3.7 this group acts transitively on S™. 


Example 6.3.17. A compact two-dimensional manifold of constant neg- 
ative curvature is locally symmetric (as its universal cover is symmetric) 
but not homogeneous (as closed geodesics of a given period are isolated). 
Hence it is not symmetric. This shows that the hypothesis that M be simply 
connected cannot be dropped in Corollary 6.3.6. 


Example 6.3.18. The real projective space RP” with the metric inherited 
from S” is a symmetric space and the orthogonal group O(n+1) acts on it by 
isometries. The complex projective space CP” with the Fubini-Study metric 
in Example 3.7.5 is a symmetric space and the unitary group U(n + 1) acts 
on it by isometries. The complex Grassmannian G;(C") in Example 3.7.6 
is a symmetric space and the unitary group U(n) acts on it by isometries. 
(Exercise: Prove this.) 


Example 6.3.19. The simplest example of a symmetric space which is not 
of constant curvature is the orthogonal group O(n) = { gE R™*" | gl g= 1} 
with the Riemannian metric (5.2.19) of Example 5.2.17. The symmetry ¢ 
about the point a € O(n) is given by ¢(g) = ag~!a. This discussion extends 
to every Lie subgroup G C O(n). (Exercise: Prove this.) 
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6.4 Constant Curvature 


In the §5.3 we saw that the GauSian curvature of a two-dimensional surface 
is intrinsic: we gave a formula for it in terms of the Riemann curvature 
tensor and the first fundamental form. We may use this formula to define the 
Gaufian curvature for any two-dimensional manifold (even if its codimension 
is greater than one). We make a slightly more general definition. 


6.4.1 Sectional Curvature 


Definition 6.4.1. Let M CR” be a smooth m-dimensional submanifold. 
Letp€ M and let E C TM be a 2-dimensional linear subspace of the tan- 
gent space. The sectional curvature of M at (p, E) is the number 


(Rp(u, vv, u) 


K(p, FE) = 
a aa a 


(6.4.1) 


where u,v € E are linearly independent (and hence form a basis of EF). 


The right hand side of (6.4.1) remains unchanged if we multiply u or v 
by a nonzero real number or add to one of the vectors a real multiple of the 
other; hence it depends only on the linear subspace spanned by wu ad v. 


Example 6.4.2. If M C R® is a 2-manifold, then by Theorem 5.3.9 the 
sectional curvature K(p,T,M) = K(p) is the Gaufian curvature of M at p. 
More generally, for any 2-manifold M Cc R” (whether or not it has codimen- 
sion one) we define the GauBian curvature of M at p by 


Kip) = Kp, 75M). (6.4.2) 


Example 6.4.3. If M Cc R'™*! is a submanifold of codimension one and 
vy: M—- S™ is a Gau8 map, then the sectional curvature of a 2-dimensional 
subspace E’ C J;,M spanned by two linearly independent tangent vectors 
u,v € T,M is given by 


(u, dv(p)u)(v, dv(p)v) — (u, dv(p)v)? 
|ul?|ul? — (u,v)? ) 


K(p, E) = (6.4.3) 


This follows from equation (5.3.8) in the proof of Theorem 5.3.9 which holds 
in all dimensions. In particular, when M = S™, we have v(p) = p and hence 
K(p, £) = 1 for all p and E. For a sphere of radius r we have v(p) = p/r 


and hence K(p, E) = 1/r?. 


6.4. CONSTANT CURVATURE 281 


Example 6.4.4. Let G C O(n) be a Lie subgroup equipped with the Rie- 
mannian metric 

(v, w) := trace(v'w) 
for v,w € TyG C R"*”. Then, by Example 5.2.17, the sectional curvature 
of G at the identity matrix 1 is given by 


K(1, B) = Fm? 


for every 2-dimensional linear subspace EF C g = Lie(G) = 7TyG with an 
orthonormal basis €, 77. 


Exercise 6.4.5. Let FE C T,M be a 2-dimensional linear subspace, let r > 0 
be smaller than the injectivity radius of M at p, and let N C M be the 2- 
dimensional submanifold given by 


N := exp, ({v € B| |v| < r}). 


Show that the sectional curvature K(p, FE) of M at (p, £) agrees with the 
Gaufian curvature of N at p. 


Exercise 6.4.6. Let p ¢ M C R” and let EF C TM be a 2-dimensional lin- 
ear subspace. For r > 0 let LZ denote the ball of radius r in the (n — m + 2)- 


dimensional affine subspace of R” through p and parallel to the vector sub- 
space H+ TM 


L={p tu+wlveéb,weTpM, ju? + wl? <r? h. 


Show that, for r sufficiently small, 21 M is a 2-dimensional manifold with 
Gaufian curvature Ky7is(p) at p given by Kram (p) = K(p, £). 


6.4.2 Constant Sectional Curvature 


Definition 6.4.7. Let k € R and m > 2 be an integer. An m-manifold 
M CR” is said to have constant sectional curvature k iff K(p, FE) =k 
for every p€ M and every 2-dimensional linear subspace E C T,M. 


Theorem 6.4.8. Let M CR” be an m-manifold and fiz an element p € M 
and a real number k. Then the following are equivalent. 


(i) K(p, E) =k for every 2-dimensional linear subspace E C T,M. 


(ii) The Riemann curvature tensor of M at p is given by 


(Rp(v1, V2)U3, U4) =k ((w, v4) (v2, v3) = (v1, v3) (v2, v4)) (6.4.4) 


for all v1, v2, v3, U4 € TM. 
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Proof. That (ii) implies (i) follows directly from the definition of the sec- 
tional curvature in (6.4.1) by taking v1 = v4 = u and v2 = v3 = v in (6.4.4). 
Conversely, assume (i) and define the multi-linear map Q : T, M+ > R by 


Q(v1, V2, v3, U4) = (Rp(v1, v2) v3, va) — k((v1, v4) (v2, v3) — (v1, U3) (V2, U4 )). 


Then, for all u,v, v1, v2, 03, va € TpM, the map Q satisfies the equations 


Q(v1, v2, V3, v4) + Q(v2, V1, V3, V4) = UF (6.4.5) 

Q(v1, v2, 03, V4) + Q(v2, v3, 01, V4) + Q(v3, V1, v2, v4) = 0, (6.4.6) 
Q(v1, v2, U3, V4) — Q(v3, V4, V1, 02) = 0, (6.4.7) 

Q(u,v,u, 2) = 0. (6.4.8) 


Here the first three equations follow from Theorem 5.2.13 and the last follows 
from the definition of Q and the hypothesis that the sectional curvature 
is K(p, E) =k for every 2-dimensional linear subspace E C T,M. 

We must prove that Q vanishes. Using (6.4.7) and (6.4.8) we find 


0 = Q(u, v1 + v2, U, V1 + V2) 
_ Q(u, v1, U, v2) + Q(u, v2, U, U1) 
= 2Q(u, V1, U, V2) 


for all u,v1,v2 € T,M. This implies 
0 = Q(ui + Ua, v1, U1 + U2, v2) 
= Q(u1, V1, U2, V2) + Q(u2, V1, U1, V2) 


for all uy, u2,v1, vo € TpM. Hence 


Q(v1, V2, 03, v4) = —Q(v3, v2, V1, V4) 
= Q(v2, V3, V1, U4) 


= —Q(v3, v1, v2, v4) _ Q(v1, v2, U3, U4). 


Here the second equation follows from (6.4.5) and the last from (6.4.6). Thus 


1 1 
Q(v1, V2, V3, U4) F — 5 @(v3, 1, ¥2, M4) — 2 (v1, U3, ¥2, M4) 


for all v1, v2, v3, va € TpM and, repeating this argument, 


1 
Q(v1, v2, 03, U4) = 12 (e1, V2, U3, U4). 


Hence Q = 0 as claimed. This proves Theorem 6.4.8. 
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Remark 6.4.9. The symmetric group $4 on four symbols acts naturally 
on the space £*(T,,M, R) of multi-linear maps from T,M*4 to R. The condi- 
tions (6.4.5), (6.4.6), (6.4.7), and (6.4.8) say that the four elements 


a = id+(12) 


ll 
aa 
1 o+ 


d = id+ (13) + (24) + (13)(24) 

of the group ring of S4 annihilate Q. This suggests an alternate proof of 
Theorem 6.4.8. A representation of a finite group is completely reducible 
so one can prove that Q = 0 by showing that any vector in any irreducible 
representation of $4 which is annihilated by the four elements a,b,c and 
d must necessarily be zero. This can be checked case by case for each 
irreducible representation. (The group Sy has 5 irreducible representations: 
two of dimension 1, two of dimension 3, and one of dimension 2.) 


If M and M’ are two m-dimensional manifolds with constant curvature k, 
then every orthogonal isomorphism © : T,M — TM’ intertwines the Rie- 
mann curvature tensors by Theorem 6.4.8. Hence by the appropriate version 
(local or global) of the C-A-H Theorem we have the following corollaries. 


Corollary 6.4.10. Every Riemannian manifold with constant sectional cur- 
vature is locally symmetric. 


Proof. Theorem 6.3.4 and Theorem 6.4.8. 


Corollary 6.4.11. Let M and M' be m-dimensional Riemannian manifolds 
with constant curvature k and let p € M and p' € M’. Ifr > 0 is smaller 
than the injectivity radii of M at p and of M' at p’, then for every orthogonal 
isomorphism 

@:7,M > T,M' 


there exists an isometry 
Q: U,(p, M) > U,(p', M’) 
such, that 


Proof. This follows from Corollary 6.3.5 and Corollary 6.4.10. Alternatively 
one can use Theorem 6.4.8 and the local C-A-H Theorem 6.1.16. 
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Corollary 6.4.12. Any two connected, simply connected, complete Rieman- 
nian manifolds with the same constant sectional curvature and the same 
dimension are isometric. 


Proof. Theorem 6.4.8 and the global C-A-H Theorem 6.1.7. 


Corollary 6.4.13. Let M C R” be a connected, simply connected, complete 
manifold. Then the following are equivalent. 


(i) M has constant sectional curvature. 


(ii) For every pair of points p,q © M and every orthogonal linear isomor- 
phism ®:T,M — T,M there exists an isometry ¢: M — M such that 


O(p) = 4, do(p) = ®. 


Proof. That (i) implies (ii) follows immediately from Theorem 6.4.8 and the 
global C-A-H Theorem 6.1.7. Conversely assume (ii). Then, for every pair 
of points p,q € M and every orthogonal linear isomorphism 


®:7,M > TM, 
it follows from Theorem 5.3.1 (Theorema Egregium) that 
®,Rp = Rg 


and so 
K(p, E) = K(q, ®E) 


for every 2-dimensional linear subspace E' C T,M. Since, for every pair of 
points p,q € M and of 2-dimensional linear subspaces 


ECT)M, FCM, 
we can find an orthogonal linear isomorphism ® : T,M — T,M such that 


SE =F, 


this implies (i). 


Corollary 6.4.13 asserts that a connected, simply connected, complete 
Riemannian m-manifold M has constant sectional curvature if and only if 
the isometry group Z(M) acts transitively on its orthonormal frame bun- 
dle O(M). Note that, by Lemma 5.1.10, this group action is also free. 
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Examples and Exercises 


Example 6.4.14. Any flat Riemannian manifold has constant sectional 
curvature k = 0. 


Example 6.4.15. The manifold 
M =kR™ 


with its standard metric is, up to isometry, the unique connected, simply 
connected, complete Riemannian m-manifold with constant sectional curva- 
ture 

k=0. 


Example 6.4.16. For m > 2 the unit sphere 
M= 5” 


with its standard metric is, up to isometry, the unique connected, simply 
connected, complete Riemannian m-manifold with constant sectional curva- 
ture 

k=1. 


Hence, by Corollary 6.4.12, every connected simply connected, complete 
Riemannian manifold with positive sectional curvature k = 1 is compact. 
Moreover, by Corollary 6.4.13, the isometry group Z(S”™) is isomorphic to 
the group O(m +1) of orthogonal linear transformations of R™+!. Thus, 
by Corollary 6.4.13, the orthonormal frame bundle O(S™) is diffeomorphic 
to O(m +1). This follows also from the fact that, if 


Uly+++5Um 
is an orthonormal basis of T,S™ = p+ then 


DP,U1,+-++5Um 
is an orthonormal basis of R™+!. 


Example 6.4.17. A product of spheres is not a space of constant sectional 
curvature, but it 7s a symmetric space. Exercise: Prove this. 


Example 6.4.18. For n > 4 the orthogonal group O(n) is not a space of 
constant sectional curvature, but it is a symmetric space and has nonnegative 
sectional curvature (see Example 6.4.4). 
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6.4.3. Hyperbolic Space 


Fix an integer m > 2. The hyperbolic space H”™ is, up to isometry, the 
unique connected, simply connected, complete Riemannian m-manifold with 
constant sectional curvature k = —1. A model for H’ can be constructed 
as follows. A point in R’”*! will be denoted by 


p= (z0,2); zo ER, & = (£1,...,2m) € R™. 
Let Q: R™*+! x R™*! —  R denote the symmetric bilinear form given by 
Q(p, ¢) := —Loyo + 21y1 +++ + LmYm (6.4.9) 
for p = (xo, 2),q = (yo, y) € R™*. Since Q is nondegenerate the space 
H” := {p = (xp, 2) € R™*" | Q(p,p) = -1, 20 > o} 


is a smooth m-dimensional submanifold of R’*! and the tangent space 
of H™ at p is given by 


0 = {v a ier! |Q(p, v) = 0} . 
For p = (ao, 2) € R™*! and v = (£9, €) € R™*! we have 


peEeH” <= xw=Vl14lal, 
(E,2) 


veT,H” — =~ 


Now let us define a Riemannian metric on H™ by 


Ip(v, w) = Q(v, w) 


= (,n) ~ (6.4.10) 
7 a SSM 7 
_ (€,7) io |x|? 


for v= (£0, €) € TpH™ and w = (70,7) € T,pH™. 


Theorem 6.4.19. HI” is a connected, simply connected, complete Rieman- 
nian m-manifold with constant sectional curvature k = —1. 


Proof. See page 287. 
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We remark that the manifold H’™ does not quite fit into the extrinsic 
framework of most of this book as it is not exhibited as a submanifold 
of Euclidean space but rather of “pseudo-Euclidean space”: the positive 
definite inner product (v,w) of the ambient space R’*! is replaced by a 
nondegenerate symmetric bilinear form Q(v,w). However, all the theory 
developed thus far goes through (reading Q(v,w) for (v,w)) provided we 
impose the additional hypothesis (true in the example M = Hi”) that the 
first fundamental form g, = Q\r, Mm is positive definite. For then Q\|z, Mm is 
nondegenerate and we may define the orthogonal projection II(p) onto T, 
as before. The next lemma summarizes the basic observations; the proof is 
an exercise in linear algebra. 


Lemma 6.4.20. Let Q be a symmetric bilinear form on a vector space V 
and for each subspace E of V define its orthogonal complement by 


E+@ := {we V|Q(u,v) =0 We E}. 


Assume Q is nondegenerate, i.e. Vt2 = {0}. Then, for every linear sub- 
space FF CV, we have 


V=EQE‘2 = En£E+e = {0}, 


i.e. E+@ is a vector space complement of E if and only if the restriction 
of Q to E is nondegenerate. 


Proof of Theorem 6.4.19. The proofs of the various properties of HI” are 
entirely analogous to the corresponding proofs for S™. Thus the unit normal 
field to H™ is given by 


U(p) = p 
for p € H™ although the “square of its length” is 
Q(p,p) = —1. 


For p € H’ we introduce the Q-orthogonal projection II(p) of R™*! onto 
the subspace 77H. It is characterized by the conditions 


I(p)? = II(p), ker H(p) Lg imII(p), imII(p) = T,H”, 
and is given by the explicit formula 
I(p)v =v + Ql, p)p 


for v € R™+, 
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The covariant derivative of a vector field X € Vect(y) along a smooth 
curve y: R > H”™ is given by 
VX(t) = I(y(t)) X (2) 
= X(t) + Q(X (4), 1) 1) 
= X(t) — Q(X), 1) 1). 


The last identity follows by differentiating the equation Q(X,7) = 0. This 
can be interpreted as the hyperbolic Gau$8—Weingarten formula as follows. 
For p € H™ and u € T,H”™ we introduce, as before, the second fundamental 
form 

hp(u) : TpH™ > (T,H™)+¢ 


via 
hp(u)v := (dII(p)u)v 
and denote its Q-adjoint by 
hy(u)* : (TpH™)+¢ > T,H™. 


For every p € R™+! we have 


(attiryu)e =F) (e+ Qn +e e+ tu) 
= Qu, p)u + Q(v, wp, 


where the first summand on the right is tangent to H and the second 
summand is @-orthogonal to T,H". Hence 


hp(u)v = Q(v, u)p, hp(u)*w = Q(u, p)u (6.4.11) 


for v € T,H" and w € (T,H™)+2. 
With this understood, the Gau$-Weingarten formula 


X=VX+hy(y)X 
extends to the present setting. The reader may verify that the operators 
V : Vect(y) > Vect(7) 


thus defined satisfy the axioms of Theorem 3.7.8 and hence define the Levi- 
Civita connection on H’”. 
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Now a smooth curve y : J > H"™ is a geodesic if and only if it satisfies 
the equivalent conditions 


Vy=0 = F)LoTwH” Vier = F¥=OQ(4,4)2. 


A geodesic must satisfy the equation 


@. cee yx die A 

Holy) = 204, 7) = 0 

because 4 is a scalar multiple of y, and hence Q(¥,7) is constant. Fix an 
element p € HI” and a tangent vector v € T,H’” such that 


Q(v,v) = 1. 
Then the geodesic 7 : R > H™ with 7(0) = p and ¥(0) = v is given by 
y(t) = cosh(t)p + sinh(t)v, (6.4.12) 

where 
eb+et eb—et 

9 o] 
In fact we have 7(t) = y(t) Le Tya)H'". It follows that the geodesics 
exist for all time and hence H"™ is geodesically complete. Moreover, being 
diffeomorphic to Euclidean space, H’ is connected and simply connected. 


It remains to prove that H™ has constant sectional curvature k = —1. 
To see this we use the GauB—Codazzi formula in the hyperbolic setting, i.e. 


cosh(t) := 


Ryluse) = hg)" Reo) = Tyg (ay hela): (6.4.13) 


By equation (6.4.11), this gives 


(Rp(u, v)v, u) = Q(hp(u)u, Rp(v)v) — Q(hp(v)u, hp(u)v) 
= Q(Q(u, u)p, Q(v, v)p) — Q(Q(u, v)p, Q(u, v)p) 
= —Q(u, u)Q(v, v) + Q(u, 0)? 

—Gp(U; U)Gp(v, v) + gp(u, v)? 


for all u,v € T,HI”. Hence, for every p € M and every 2-dimensional linear 
subspace E' C T,M with a basis u,v € E we have 
(Rp(u, v)v, u) 


= Cg tee 


This proves Theorem 6.4.19. 
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Exercise 6.4.21. Prove that the pullback of the metric on H under the 


diffeomorphism 
R” SH": 2H (v1 + eR, 2) 
is given by 
\2| _ |z/? Cpa 


x 


—1+4eP 
for « € R™ and & € R” = T,R™. Thus the metric tensor is given by 
Lily 
1+ |a/? 


93 (x) = dij — (6.4.14) 


for a = (F145 24,2) €E™. 
Exercise 6.4.22. The Poincaré model of hyperbolic space is the open 
unit disc D™ C R™ equipped with the Poincaré metric 
fal, = 
= 2 
ee |) 


for y € D™ and 7 € R™ = T,D™. Thus the metric tensor is given by 
4d;; 


(1- (uP?) 


Prove that the diffeomorphism 


2 
moon yes (Ll 2y 


9i5(Y) = ye D”. (6.4.15) 


1—|y|?? 1—|y/? 


is an isometry with the inverse 
x 
1+ ZO ; 


H” > D™: (20,2) 


Interpret this map as a stereographic projection from the south pole (—1,0). 


Exercise 6.4.23. The composition of the isometries in Exercise 6.4.21 and 
Exercise 6.4.22 is the diffeomorphism R™ — D" : x7 +> y given by 


x Qy 1+|y/? 
jo, ge. EP , 
J1+|zP2+1 1—|y/? 1—|y|? 


Prove that this is an isometry intertwining the Riemannian metrics (6.4.14) 
and (6.4.15). 
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Exercise 6.4.24. This exercise shows that every nonconstant geodesic in 
the Poincaré model D™ of hyperbolic space in Exercise 6.4.22 converges to 
two points on the boundary S™~! = 0D” in forward and backward time, 
and that any two distinct points on the boundary are the asymptotic limits 
of a unique geodesic in D” up to reparametrization. 

Fix an element y € D™” and a tangent vector ¥ € T,yD™ = R™ of norm 
one in the hyperbolic metric, i.e. 


2 
Aly] = 1, A:= ——. 
a [=e 


Let y: R- D™ be the unique geodesic satisfying y(0) = y and ¥(0) = y. 
Prove the following. 


(a) The geodesic y is given by the explicit formula 


h(t)A inh(t) (AY + (Ay, AY 
1 + cosh(t)(A — 1) + sinh(t)(Ay, Ag) 
for t € R. Hint: Use (6.4.12) and the isometries in Exercise 6.4.22. 


(b) The limits 


yt i= lim y(t)e S™1 


too 


exist and are given by 


— Ay + AY + (Ay, AY)Y _ Ay AG — Ay ANDY 
TT N=TE OYA) 7 = 1 = Dy, 9) 


(6.4.18) 


(c) Assume ¥ ¢ Ry. Then there is a unique circle in R™ through y_ and y+ 
that is orthogonal to S™~! at yz. The center c € R™ and the radius r of 
this circle are given by 


on Met (A? — A= (Ay, A9)?)y — (Ay, AD)AG 
a | 0) ) 
pe ren -_ 1 
Le (ue,y-) AP = 2A = Ay, Ay)? 


(d) Let cand r be as in (c). Then the geodesic ¥ in (a) satisfies |y(t) — c| =r 
for all t. Hint: It suffices to verify this equation for t = 0. 

(e) If y € Ry, then y_ + y; = 0 and the geodesic y traverses a segment of 
a straight line through the origin. 

(f) Fix two distinct points y_ and y, on the unit sphere $”~!. Then there 
exists a geodesic 7: R > D™ such that lim; 4+. y(t) = ys. If 7: R- D” 
is any other geodesic satisfying limy.+. y(t) = ys, then there exist real 
numbers a,b such that a > 0 and y/(t) = y(at + b) for all t € R. 
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Exercise 6.4.25. Prove that the isometry group of H™ is the pseudo-ortho- 
gonal group 


Z(H™) = O(m, 1) := {9 € GL(m +1) Q(gu, gw) = Q(v, wv) } . 


for all v,w € R™*! 


Thus, by Corollary 6.4.13, the orthonormal frame bundle O(H"") is diffeo- 
morphic to O(m, 1). 


Exercise 6.4.26. Prove that the exponential map exp, : Tp,H™ — H”™ is 
given by 


exp,(v) = cosh (VQl,»)) o+ ‘a ee) a) (6.4.20) 


for v € T,H™ = p-@. Prove that this map is a diffeomorphism for every 
p € H™. Thus any two points in H™ are connected by a unique geodesic. 
Prove that the intrinsic distance function on hyperbolic space is given by 


d(p,q) = cosh7' (Q(p, q)) (6.4.21) 
for p,q € H’. Compare this with Example 4.3.11. 


Exercise 6.4.27. In the case m = 2 the Poincaré model of hyperbolic space 
in Exercise 6.4.22 is the open unit disc D C C in the complex plane. It can 
be identified with the upper half plane 


H = {z € C|Im(z) > 0} 


via the diffeomorphism ; 

i-z 

itz 

Show that the pullback of the Poincaré metric on D under this diffeomor- 
phism is the Riemannian metric on H given by 


H-D:zvwh 


2] 
y 


for z=ax+iy € Hand Z¢€ T,H=C-. Show that the isometries of H (in the 
identity component) have the form 


az+b a b 
(2) = S4, (: : 


and deduce that the Lie group PSL(2, R) := SL(2,R)/{+1} is isomorphic to 
the identity component of O(2,1). Prove that every nonconstant geodesic 


in H traverses either a vertical half line or a semicircle centered at a point 
on the boundary OH = R. 


|Z|2 = 


) € SL(2,R), 
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6.5 Nonpositive Sectional Curvature 


In the previous section we have seen that any two points in a connected, 
simply connected, complete manifold M of constant negative curvature are 
joined by a unique geodesic (Exercise 6.4.26). Thus the entire manifold 
is geodesically convex and its injectivity radius is infinity. This continues 
to hold in much greater generality for manifolds with nonpositive sectional 
curvature. It is convenient, at this point, to extend the discussion to Rie- 
mannian manifolds in the intrinsic setting. In particular, at some point in 


the proof of the main theorem of this section and in our main example, we 
shall work with a Riemannian metric that does not arise (in any obvious 
way) from an embedding. 


Definition 6.5.1. A Riemannian manifold M is said to have nonpos- 
itive sectional curvature iff K(p,E) <0 for every p€ M and every 2- 
dimensional linear subspace E C T,M or, equivalently, (Rp(u, v)v, u) < 0 for 
allp€ M and all u,v €T,M. A nonempty, connected, simply connected, 
complete Riemannan manifold with nonpositive sectional curvature is called 
a Hadamard manifold. 


6.5.1 The Cartan—Hadamard Theorem 


The next theorem shows that every Hadamard manifold is diffeomorphic to 
Euclidean space and has infinite injectivity radius. This is in sharp contrast 
to positive curvature manifolds as the example M = S™ shows. 


Theorem 6.5.2 (Cartan—Hadamard). Let M be a connected, simply con- 

nected, complete Riemannan manifold. Then the following are equivalent. 

(i) M has nonpositive sectional curvature. 

(ii) The derivative of each exponential map is length increasing, i.e. 
ldexp,(v)8] > [0 

for allpe M and allv,v €T,M. 

(iii) Each exponential map is distance increasing, i.e. 


d(exp,(vo), exp, (v1)) Zz |vo ~ v1| 
for allp € M and all vp,v, € T,M. 


Moreover, if these equivalent conditions are satisfied, then the exponential 
map exp, : T,M — M is a diffeomorphism for everyp € M. Thus any two 
points in M are joined by a unique geodesic. 


Proof. See page 294. 
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Exercise 6.5.3. Let € : [0,00) > R” be a smooth function such that 
€(0)=0, €(0)40, E(t) 40 Vt>O. 

Prove that the function f : [0,00) + R given by f(t) := |€(t)| is smooth. 

Hint: The function 7 : [(0,00) + R” defined by 


ff fe); tore > 0, 
nt) =i é(0), fort =O, 


is smooth. Show that f is differentiable and f = |n|~‘ (n, €). 


Exercise 6.5.4. Let € : R > R” be a smooth function such that 


Prove that there exist constants « > 0 and c > 0 such that, for all t € R, 
lt}<e = IE@P EOP? — EM, EM)? < cle’. 


Hint: Write 

E(t) =tu+n(t),  €(t) =v + H(t) 
with 7(t) = O(t?) and 7(t) = O(t?). Show that the terms of order 2 and 4 
cancel in the Taylor expansion at t = 0. 


Proof of Theorem 6.5.2. We prove that (i) implies (ii). Fix a point p< M 
and two tangent vectors v,v ¢7T,M. Assume without loss of generality 
that v 4 0 and define the curve 7 : R > M and the vector field X € Vect(y) 
along y by 


y(t) := exp, (tv), X(t) := a 


exp,(t(v + sv)) © T,q)M (6.5.1) 
Os s=0 


for t€ R. Then 
X(0) = 0, AL) = dexp,(tv)tv, VA() =v 0. (6.5.2) 


To see this, define the map 8 : R? > M by {(s,t) := exp,(t(v + sv)). It 
satisfies 3(0,t) = y(t), 0,6(0,t) = X(t), B(s,0) = p, and 0:6(s,0) =v + sv 
for all s,t€ R. Hence V-X(0) = V0;8(0,0) = Vs0;6(0,0) = 0. Moreover, 
the curve 6(s,-) is a geodesic for every s, and hence Lemma 6.1.17 asserts 
that X = 0;G(0,-) is a Jacobi field along 4, i.e. 


VVX + R(X,4)¥ = 0. (6.5.3) 
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It follows from Exercise 6.5.3 with €(t) := ®,(0,t) X(t) that the function 
[(0,co) + R: th |X(t)| 


is smooth and 


d a 
| S@l=|IVXO) = el. 
t=0 
Moreover, for t > 0, we have 
ee d (X,VX) 


==. XY) = ee 
del = |X| 
_ [VX +(X,VVX) (KX, VX)? 


|X| Pai (6.5.4) 
_ |XPIVX | = (VX)? (Xx, RG, X)9) 
|x|8 |X| 


> 0. 


Here the third equality follows from the fact that X is a Jacobi field along y, 
and the inequality follows from the nonpositive sectional curvature condition 
in (i) and from the Cauchy—Schwarz inequality. Thus the second derivative 
of the function [0,00) > R: t+ |X(t)| — t]0| is nonnegative; so its first 
derivative is nondecreasing and it vanishes at t = 0; thus 


|X(t)| — t fo] = 0 
for every t > 0. In particular, for t = 1 we obtain 
|dexp,(v)8| = |X(1)| > [a 


as claimed. Thus we have proved that (i) implies (ii). 
We prove that (ii) implies (i). Assume, by contradiction, that (ii) holds 
but there exists a point p € M and a pair of vectors v,v € T,M such that 


(Rp(v,v)v,v) < 0. (6.5.5) 
Define y: R > M and X € Vect(y) by (6.5.1) so that (6.5.2) and (6.5.3) 


are satisfied. Thus X is a Jacobi field with 
X(0) = 0, VX(0) =vF0. 


Hence it follows from Exercise 6.5.4 with €(t) := ®,(0, t)X (t) that there is a 
constant c > 0 such that, for t > 0 sufficiently small, we have the inequality 


IX (PIV X (OP — (X(b), V X(t)? < et®. 
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Moreover, by (6.5.1) and (6.5.2), limy.o 7(¢) = v and limp.o t 1 X(t) = 0. 
Hence, by (6.5.5) there exist constants 6 > 0 and ¢ > 0 such that 


IX()) 25, (XH), RGM, X())7O) s -et”, 


for t > 0 sufficiently small. By (6.5.4) this implies 


Py) _ IXPIVXP - (X,VX)? | (XRG XY) — ot? _ et 
~ XP xT 8 8 


Integrate this inequality over an interval [0,t] with ct? < <6? to obtain 


d d 


dil, |X()| = |VX(0)| 


Integrating this inequality again gives |X (t)| < t|V_X(0)| for small positive t. 
Hence it follows from (6.5.2) that 


|dexp,(tv)t] = [X(é)| < t|V.X(0)| = ¢ [6] 


for t > 0 sufficiently small. This contradicts (ii). 

We prove that (ii) implies that the exponential map exp, : 7,M — M is 
a diffeomorphism for every p € M. By (ii) exp, is a local diffeomorphism, i.e. 
its derivative dexp,(v) : TpM — Texp,(v)M is bijective for every v € 7, M. 
Hence we can define a Riemannian metric on M’ := T,,M by pulling back 
the metric on M under the exponential map. To make this more explicit we 
choose a basis €1,...,€m Of T,M and define the map ~ : R’™ — M by 


m 
w(x) := exp, (> es) 
i=1 
for x = (x!,...,2™) € R™. Define the metric tensor by 


O O 
9ij (2) =(FOs5@), i,g=l,...,m. 


Then (R™,g) is a Riemannian manifold (covered by a single coordinate 
chart) and 7 : (R™,g) — M is a local isometry, by definition of g. The 
manifold (R™,g) is clearly connected and simply connected. Moreover, for 
every € = (€1,...,€") € R™ = ToR”, the curve R — R™ : th tlisa 
geodesic with respect to g (because 7 is a local isometry and the image of 
the curve under w is a geodesic in WM). Hence it follows from Theorem 4.6.5 
that (R™,g) is complete. a 
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Since both (R™,g) and M are connected, simply connected, and com- 
plete, it follows from Corollary 6.1.14 that the local isometry ~ is bijective. 
Thus the exponential map exp, : T,M — M is a diffeomorphism as claimed. 
It follows that any two points in M are joined by a unique geodesic. 

We prove that (ii) implies (iii). Fix a point p € M and two tangent 
vectors vg, v1 € T,M. Let y: [0,1] + M be the geodesic with the endpoints 


(0) =exp,(vo), (1) = exp, (v1) 


and let v : [0,1] + T,M be the unique curve satisfying exp,(v(t)) = y(t) for 
all t. Then v(0) = vo, v(1) = v1, and 


d(expp(vo), exPp(v1)) = L(y) 


it 
Z | |dexp,(v(#))(¢)| at 


Here the third inequality follows from (ii). This shows that (ii) implies (iii). 

We prove that (iii) implies (ii). Fix a point p € M and a tangent 
vector v € T,M and denote q := exp,(v). By (iii) the exponential map 
exp, : TyM — M is injective and, since M is complete, it is bijective (see 
Theorem 4.6.6). Hence there exists a unique geodesic from q to any other 
point in M and therefore, by Theorem 4.4.4, we have 


|w| = d(q, exp,(w)) (6.5.6) 


for every w € TyM. Now define ¢ := exp, | 0 exp, :T,M — T,M. This 
map satisfies ¢(v) = 0. Moreover, it is differentiable in a neighborhood of v 
and, by the chain rule, dé(v) = dexp,(v) : Tp,M — T,M. Now choose 
w := o(v +¥) with v € T,M. Then exp,(w) = exp,(¢(v +¥)) = exp,(v + v) 
and hence it follows from (6.5.6) and part (iii) that 


|o(v + 0))| = |w| = d(q, expg(w)) = d(exp,(v), expp(v + 0)) = lo]. 


This gives 


Ss ~ 1. |\O(ut+te | eee 
|dexp(v0| = |v) = fim OO EEN yy OT jay, 


Thus we have proved that (iii) implies (ii) and this proves Theorem 6.5.2. 
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The next lemma establishes a useful inequality for Hadamard manifolds 
that amplifies the expanding property of the exponential map. 


Lemma 6.5.5. Let M be a Hadamard manifold. Fix an element p © M 
and two tangent vectors v9,v1 € T,M. Then, for0<t<T, 
d(exp,(tvo), exp, (tv1)) Z d(exp,(T'vo), exp,(Tv1)) 

t = T 
Proof. The first inequality in (6.5.7) is part (iii) of Theorem 6.5.2. To prove 
the second inequality, assume vp ~ v1 and define 


(6.5.7) 


luo — v1] < 


yo(t) := exp,(tvo), yi(t) = exp,(tvz) 


for t € R. For each t € R let the curve [0,1] ~ M: s+ 7(s,t) be the unique 
geodesic with the endpoints (0, t) = yo(t) and y(1,t) = y(t). Then 


1 
o(t) := dvo(t), n(@) = / la.y| ds = |8.7(s, | 
for all s and t and hence 


sg) — [1 (Os. Vs7) 9, 
ao = | (er d es 
= (Osy(1, t), HCL #)) _ (Os7(0, t), #7(0, €)) 


p(t) 
d 


Since 4(pp) = pp + 6” and 7 and 4 are geodesics, this implies 


pliyatt) + lt)? = £((Axn(1,1),2(1,1)) — (8670, t), e410, t))) 
= (W0s7(1, t), (1, t)) — (V0s7(0, t), vy(0, t)) 


a6) 
= [5 (6i801.817) ds 


1 
= | (INianr? + (VeViO57, 917) ds 
0 


1 
= f (Imaal? + (R@s7, A727, 847)) ds 
0 
> p(t)’. 


Here the last step follows from (6.5.8), the Cauchy—Schwarz inequality, and 
the nonpositive sectional curvature assumption. Thus p: [0,7] > R isa 
convex function satisfying p(0) = 0 and hence p(t) < te(T)/T for0 <t < T. 


This proves Lemma 6.5.5. 
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6.5.2 Cartan’s Fixed Point Theorem 


Recall from Definition 6.5.1 that a Hadamard manifold is a nonempty, con- 
nected, simply connected, complete Riemannian manifold of nonpositive 
sectional curvature. 


Theorem 6.5.6 (Cartan). Let M be a Hadamard manifold and let G be a 
compact topological group that acts on M by isometries. Then there exists 
a point p € M such that gp = p for every g € G. 


Proof. See page 300. 


The proof follows the argument given by Bill Casselmann in [16] and 
requires the following two lemmas. The first lemma asserts that every com- 
plete, connected, simply connected Riemannian manifold of nonpositive sec- 
tional curvature is a semi-hyperbolic space in the sense of Alexandrov [3]. 


Lemma 6.5.7 (Alexandrov). Let M be a Hadamard manifold, let m © M 
andv € T,,M, and define 


Po = exp, (a) P= €XP (UV). 


Then 3 
d(po, P1) 


< d(po, 4)? + d(p1,.4)° 
for everyqeE M. 


Proof. By Theorem 6.5.2 the exponential map exp,, : TImM—- M isa 
diffeomorphism. Hence d(po,pi) = 2\v|. Now let gq € M. Then there is a 
unique tangent vector w € T;,M such that 


T=EXPm(w), — d(m,q) = |w]. 


Since the exponential map is expanding, by Theorem 6.5.2, we have 


d(po, q) = Jw + v|, d(p1, q) = Jw ~ v|. 
Hence 
d(m,qy = |w? 
Jw + o|? + |w — o/? 2 
; | 
- A€po,9)" +4r1,9)? _ Apo, P1)" 
= 2 4 : 


This proves Lemma 6.5.7. 
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The next lemma is Serre’s Uniqueness Theorem for the circumcentre 
of a bounded set in a semi-hyperbolic space. 


Lemma 6.5.8 (Serre). Let M be a Hadamard manifold and, for p © M 
and r > 0, denote by B(p,r) C M the closed ball of radius r centered at p. 
Let XC M be a nonempty bounded set and define 


rq := inf {r > 0| there exists ap € M such that QC B(p,r)}. 
Then there exists a unique point po € M such that 2 C B(pg,ra). 
Proof. We prove existence. Choose sequences r; > rg and p; € M such that 
Qc B(pi,ri), Tir 7% = 7m. 
Ico 
Choose gq € 2. Then d(q,p;) < r; for every 7. Since the sequence r; is 
bounded and M is complete, it follows that p; has a convergent subsequence, 


still denoted by p;. Its limit pg := lim;_,.. p; satisfies Q C B(pa, ra). 
We prove uniqueness. Let po,p, € M such that 


QC B(po, ra) a B(p1, 7a). 


Since the exponential map exp, : T,M — M is a diffeomorphism (by Theo- 
rem 6.5.2), there exists a unique vector vp € Tp,M such that p; = exp,, (vo). 
Denote the midpoint between po and p; by 


M := €XPp, (v0) ; 
Then it follows from Lemma 6.5.7 that 


d 24d 2 d(po,p1)” 

dona? = (po, q)" + d(p1, 9) (po, P1) 
2 4 
2 
2 gn d(po, 1) 
4 

for every q € Q. Since supzeq d(m, g) = Ta (by definition of rq), it follows 
that d(po,pi) = 0 and hence po = pi. This proves Lemma 6.5.8. 


Proof of Theorem 6.5.6. Let q © M and consider the group orbit 
Q:= {gq|9 € G}. 


Since G is compact, this set is bounded. Let rg and pg be as in Lemma 6.5.8. 
Then 2 Cc B(pg,ra). Since G acts on M by isometries, this implies 


Q = gQ Cc B(gpa,ra) 


for all g € G. Hence it follows from the uniqueness statement in Lemma 6.5.8 
that gpg = pa for every g € G. This proves and Theorem 6.5.6. 
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6.5.3 Positive Definite Symmetric Matrices 


We close this section with an example of a nonpositive sectional curvature 
manifold which plays a key role in Donaldson’s approach to Lie algebra 
theory [17] (see §7.6). Let m be a positive integer and consider the space 


P = P(R"):= {P e¢R™m | PSPs 0} (6.5.9) 


of positive definite symmetric m x m-matrices. (Here the notation “P > 0” 
means (xz, Px) > 0 for every nonzero vector x € R™.) Thus # is an open 
subset of the vector space Y := {S € R™*™|S' = $} of symmetric ma- 
trices and hence the tangent space of Y is TpY = Y for every P € &. 
However, we do not use the metric inherited from the inclusion into .Y but 
define a Riemannian metric by 


(P,, Py) p — trace(P,P~1P,P~') (6.5.10) 
for P€ Aand Pi, P» EC TpYA=.-F. 
Theorem 6.5.9. The space Y with the Riemannian metric (6.5.10) is a 


connected, simply connected, complete Riemannian manifold with nonposi- 
tive sectional curvature, and the distance function on F is given by 


uP) = 1[trace( (log(P-1/2QP-1/2))*) (6.5.11) 


for P,Q € F. Moreover, F is a symmetric space and the group GL(m, R) 
of nonsingular m x m-matrices acts on Y by isometries via P 4 gPg! 
for g © GL(m,R). 


Proof. See page 305. 


Remark 6.5.10. Let V be an m-dimensional vector space and # C S?V* 
be the set of inner products on V. Define a Riemannian metric on # by 


(hi, ho)h = trace(S1S2), h(-, Si") = fies (6.5.12) 


for he # and Pag ho € 1,4 = S?V*. Then every vector space isomor- 
phism a : R™ —> V determines a diffeomorphism ¢,g :# > FY via 


gao(h)=P <=> h(a, an) = (, Pon) Rm. (6.5.13) 
The derivative of dg at A in the direction wel nv is given by 
dba(h)h =P = PP'=-a7'Sa, h(-,S-):=h. (6.5.14) 


Thus ¢q is an isometry with respect to the Riemannian metrics (6.5.12) 
on # and (6.5.10) on Y. The ¢, form an atlas on # with the transition 
maps ¢ga(P) := ¢g°0 ¢,1(P) = 980P Oper where gga := Bla € GL(m,R). 
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Remark 6.5.11. The submanifold 
Yo = A(R") =H {Pe 4 | dal?) = 1} (6.5.15) 


of positive definite symmetric m x m-matrices with determinant one is to- 
tally geodesic (see Remark 6.5.12 below). Hence all the assertions of Theo- 
rem 6.5.9 (with GL(m,R) replaced by SL(m, R)) remain valid for Wp. 


Remark 6.5.12. Let M be a Riemannian manifold and L Cc M be a sub- 
manifold. Then the following are equivalent. 


(i) If y: I > M is a geodesic on an open interval J such that 0 € J and 
WOVE L, 70) € Ty L, 


then there is a constant ¢ > 0 such that y(t) € L for |t| < e. 


(ii) If y : J > L is a smooth curve on an open interval J and ®, denotes 
parallel transport along y in M, then 


®,(t, 8)Ty(5)L = TyayL Vs,tel. 


(iii) If y : J > L is a smooth curve on an open interval J and X € Vect(y) 
is a vector field along y (with values in TM), then 


X(t) e Typ L Vtel => VX(t) € TyyL Vtel. 


A submanifold that satisfies these equivalent conditions is called totally 
geodesic 


Exercise 6.5.13. Prove the equivalence of (i), (ii), (iii) in Remark 6.5.12. 
Hint: Choose suitable coordinates and translate each of the three assertions 
into conditions on the Christoffel symbols. 


Exercise 6.5.14. Prove that Wo is a totally geodesic submanifold of Y. 
Prove that Y is diffeomorphic to the quotient GL(m, R)/O(m) via polar de- 
composition and that Yo is diffeomorphic to the quotient SL(m, R)/SO(m). 
Hint: Consider the map GL(m,R) > A:gH Vgg'. 


Exercise 6.5.15. In the case m = 2 prove that Yo is isometric to the 
hyperbolic space H?. 


The proof of Theorem 6.5.9 is based on the calculation of the Levi-Civita 
connection and the formulas for geodesics and the Riemann curvature tensor 
in the following three lemmas. 
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Lemma 6.5.16. Let 1 > A: t+ P(t) be a smooth path in FY on an 
interval IC R and let I > A: t+ S(t) be a vector field along P. Then 
the covariant derivative of S' is given by 


VS=6= ae = oe (6.5.16) 


Proof. The formula (6.5.16) determines a family of linear operators on the 
spaces of vector fields along paths that satisfy the torsion-free condition 


Vs0,P = V,0;P 
for every smooth map R? + # : (s,t) ++ P(s,t) and the Leibniz rule 
V (51,92) p = (VS1, S2) p + ($1, VS2) p 
for any two vector fields S; and Sg along P. These two conditions determine 


the covariant derivative uniquely (see Lemma 3.6.5 and Theorem 3.7.8). 
This proves Lemma 6.5.16. 


Lemma 6.5.17. The geodesics in FY are given by 


ai) = Pexp(tP~'P) 
= exp(tPP~')P (6.5.17) 
= PY exp(tP-V/? Pp- V2) pl/? 
for PE, PeTpP= SY, andtER. In particular, Y is complete. 


Proof. The curve y: R + F defined by (6.5.17) satisfies 


at) = P exp(tP~'P) = PP-'y(t). 
Hence it follows from Lemma 6.5.16 that 


V(t) = 4(6) — (1) 14) 
= 4(t) -— PP“'¥(t) 
= 0 


for every t € R. Hence ¥ is a geodesic. Since the curve y: R > # in (6.5.17) 
satisfies y(0) = P and 4(0) = P, this proves Lemma 6.5.17. 
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Lemma 6.5.18. For Pe Y, S,T,A€ S the curvature tensor on F is 
1 i 
Rp(S,T)A= —zSPUTP'A = po 
i 1 
- qiP USP A + qAP USP OT (6.5.18) 
l 1 -1 -1 
aie [|sP PP), AP™| P. 


Proof. Choose smooth maps P : R? > Y and A: R? > .Y and define 
S :=0sP and T := OP. Then 0;T = 0):S and Rp(S,T)A = VsYWA-VVA. 
Moreover, by Lemma 6.5.16 we have V,A = 0,A — 5AP 18 _ 59P1A 
and YA = 0,A — 5AP TT — 50 P-1A. Hence 


1 1 
Rp(S,T)A=0,YA— (ViA)P'S - ySP (MA) 
1 1 
— O,V,A + 5 (WeA)P OT 5TP'(VeA) 
= 0,(OA -54P"'T - 5TP A) 
1 t gpet Tease -1 
_ 5 (&A - 5AP 7 — =TP A)P Ss 
1 —1 1 —1 1 —1 
=5P (aA =AP"'T - TP A) 
— a,(8,4—- 1 ap-ig_ 5SP1A) 
2 2 
1 Teel Le —1 
+5 (0.4 =AP-1S — =SP A)P T 
TP (0.4 — 5AP-18 —5SP A). 
A term by term inspection shows that the partial derivatives of A, S, T 
cancel because 0,1’ = O,S. Hence we are left with the drivatives of P, so 
Rp(S,T)A 
1 1 
= 5AP'(OsP)P'T + 5TP'(0sP)P A 
+ Ap Tes + {FP Ap ts + aa aa + ee ae 


_ SAP (.P)P~1 _ 59P(P)P1A 


1 1 1 i 
= qAP USP UT = qSP CAP UT = qiPp AP'S = qrP USP UA. 


Insert 0,P = S, 0:P = T to obtain (6.5.18). This proves Lemma 6.5.18. 
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Proof of Theorem 6.5.9. The manifold Y is obviously connected and simply 
connected as it is a convex open subset of a finite-dimensional vector space. 
That the map GL(m,R) x A> FY: (g,P)> gPg' defines a group action 
of GL(m,R) on 7 By isometries follows directly from the definitions. The 
remaining assertions will be proved in three steps. 


Step 1. The manifold PY has nonpositive sectional curvature. 
By Lemma 6.5.18 with A = T and equation (6.5.10) we have 
(S, Rp(S,T)T) p 
= trace (SP~'(Rp(S, DP) 


= —Ttrace (SPS 2P | ee) 


1 1 

= jtrace (SP a BY rc ca a — gtrace (SP oy cama cao aad) 
1 1 

— aitrace (X?) — gtrace (x7x) ; 


where X := P71/29P-!7P-1/2 € R™X™. Write X =: (247 )i7=1,...,m Then, 
by the Cauchy—Schwarz inequality, we have 
trace(X?) = = tex <= dott = = trace(X'X). 
1,9 a,j 
Thus (S, Rp(S,T)T) p < 0 for all P€ and S,T € Y. This proves Step 1. 
Step 2. FY is a symmetric space. 


Fix an element A € FY and define the map ¢: Y — FY by ¢(P):= AP“!A 
for P € #. This map is a diffeomorphism, fixes the matrix A = (A), and 
its derivative at P € F is given by d¢(P \P= a = 1PP-1A for P € Tp. 
Hence d¢(A) = —id. Moreover, (d¢(P)P)¢(P)~! = —AP7 1PA-! and so 


|db(P)Placpy = trace((AP-1P.4™')”) = wace( (P™1P)) = |P\2, 
for all P € Pand P € TpY. Thus @ is an isometry and this proves Step 2. 


Step 3. The distance function on F is given by (6.5.11). 


Let P,Qe Y. Then, since # is a Hadamard manifold by Step_1 and 
Lemma 6.5.17, there exists a unique matrix P € Y such that exp P (P) =), 
By Lemma 6.5.17 this equation reads P!/? exp(P~!/2PP-1/2)Pl/2 = Q. 
Thus S$ := P-'/?Pp-'/? — log(P-/?QP-V/?) and 

diPoyr= |P|2 = trace(PP-'PP~*) = trace(S”). 
This proves Step 3 and Theorem 6.5.9. 
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Remark 6.5.19. Theorem 6.5.9 carries over to the complex setting as fol- 
lows. Replace Y by the space 


Q={(Ver" | =]Os0} (6.5.19) 


of positive definite Hermitian matrices. Here Q* denotes the conjugate trans- 
posed matrix of Q € C™*™ and the notation “Q > 0” means z*Qz > 0 for 
every nonzero vector z € C™. Thus 2 is an open subset of the vector space 
of Hermitian m x m-matrices. Define the Riemannian metric on Z by 


(A, H2)g = Re(trace(H1Q~'H2Q™")) 


for Q € 2 and Mi, Hy € Tg. Then all the assertions of Theorem 6.5.9 
(with GL(m, R) replaced by GL(m, C)) carry over to 2. The proof is verba- 
tim the same, with the transposed matric replaced by the conjugate tram- 
sposed matrix and the trace replaced by the real part of the trace. 


Remark 6.5.20. The set 2p := {Q E Z| det(Q) = 1} of positive definite 
Hermitian matrices with determinant one is a totally geodesic submanifold 
of 2. Hence all the assertions of Theorem 6.5.9 (with GL(m,R) replaced 
by SL(m,C)) remain valid for Qo. as 


Exercise 6.5.21. Show that Theorem 6.5.9 remains valid for Z and that Qo 
is a totally geodesic submanifold of 2. Prove that Q is diffeomorphic to 
the quotient GL(m,C)/U(m) and 2p is diffeomorphic to SL(m, C)/SU(m). 
Hint: Consider the map SL(m,C) > 29:9 \/gg*. Show that the pull- 
back metric on SL(m, C)/SU(m) is given by the norm of the Hermitian part 
of the matrix g~!g for g € T,SL(m, C). 


Exercise 6.5.22. In the case m = 2 prove that Qo is isometric to the 
hyperbolic space H?. 


The space Yp = SL(m, C)/SU(m) (with nonpositive sectional curvature) 
can be viewed as a kind of dual space of the Lie group SU(m) (with non- 
negative sectional curvature). They have the same dimension and in both 
cases the Riemann curvature tensor is given by the Lie bracket (see equa- 
tion (5.2.22) for SU(m) and equation (6.5.18) for Yo). One can think of 
the noncompact Lie group G° := SL(m,C) as the complezification of the 
compact Lie group G := SU(m). It can be written in the form 


G° = {exp(in)u| u € G,7n € g}, (6.5.20) 


the Lie algebra of G° is the complexification g° = g @ ig of the Lie algebra 
of G, and the quotient G°/G is a Hadamard manifold. These observations 
carry over to all Lie subgroups G C SU(m). For an exposition see [20]. 
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Exercise 6.5.23 (Siegel upper half space). (i) The standard symplectic 
form wo on R?2” = R” x R” is given by 


0 -1 

—_ T = 

wl (ee = (9G): 

for z,¢ € R?” and the space of wo-compatible linear complex structures is 


the (n? + n)-dimensional manifold 


J? =-1, JJn + JnJ' =0, 


2n — 2nx2n 
T(R”, wo) = {JER wo(C, JC) > 0 for 0 CR? } (6.5.21) 


Define a Riemannian metric on J(M,wo) by 
(Ky, Jy) — trace(J; J2) (6.5.22) 


for Jiaede € T)7(R2",wo). Show that the symplectic linear group Sp(2n) 
(Exercise 2.5.5) acts on the space 7(R?",wo) by isometries J +> gJg7!. 
If J € Jo(R2”", wo) and P:= —JJo = P', show that wo(-,J-) = (-, P71). 
Show that the map J(R2",wo) > Po(R2") : JH —JJo is an Sp(2n)- 
equivariant isometric embedding, whose image is a totally geodesic subman- 
ifold of Ap(R?"). Deduce that 7(R?",wo) is a Hadamard manifold and a 
symmetric space. For every J € J(R?”,wo) show that the map J’ > —JJ'J 
is an isometry fixing J whose derivative at J is —id. 


(ii) Siegel upper half space is the manifold S, c C"*” of symmetric 
complex n x n-matrices with positive definite imaginary part [71]. The sym- 
plectic linear group Sp(2n) acts on this space via 


A B 
.Z:=(AZ+B)(CZ+D) 1, =( ). 
g ( )( ) g (6.5.23) 
Alco =C'A, B'D=D'B, Ah pC" R= 1, 
for g € Sp(2n) and Z € S,. Show that this is a well-defined group action. 
(For hints see [49, page 72].) Show that there is a unique Sp(2n)-equivariant 


diffeomorphism from S,, to 7(R?",wo) that sends ill to Jo. Show that this 
map is given by the explicit formula 


XY-! -y-xy1Xx , 
Ja) = ( yo ayoly ) E 7 (R”, wo), Z=X+iY € Sp. 
Show that the diffeomorphism S, > J(R?2",wo) : Z ++ J(Z) is an isometry 
with respect to the Riemannian metric on S,, given by 


Zip = 2trace((Y-1X)? + (yyy) (6.5.24) 
for Z =X +iY €S, and Z=X +i¥ € TS. 
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6.6 Positive Ricci Curvature* 


In this section we prove that every complete connected manifold M Cc R” 
whose Ricci curvature satisfies a uniform positive lower bound is necessarily 
compact. If the sectional curvature is constant and positive, this follows 
from Corollary 6.4.12 as was noted in Example 6.4.16. 


Definition 6.6.1 (Ricci tensor). Let M Cc R” be an m-dimensional sub- 
manifold and fix an element pe M. The Ricci tensor of M at p is the 
symmetric bilinear form 


Ricp : TyM x Ty>M > R 


defined by 
m 
Ricp(u, v) = )_(Rp(ei,u)v, e), (6.6.1) 
i=l 
where €1,...,€m 18 an orthonormal basis of T,M. The Ricci tensor is inde- 


pendent of the choice of this orthonormal frame and is symmetric by equa- 
tions (5.2.16) and (5.2.18) in Theorem 5.2.18. 


The Ricci Tensor in Local Coordinates 


Let 6: U — 1 bea local coordinate chart on an open set U C M with values 
in an open set 2 C R™, denote its inverse by = := @7! : Q > U, and let 


O 
Lie) = ve Lr) € Tyx)M rEeQ, 4. = Dy eceg Tbs 


be the local frame of the tangent bundle determined by this coordinate chart. 
Denote the coefficients of the first fundamental form by 4 = Ey £7) and 


the coefficients of the Riemann curvature tensor by Rt jhe? :Q—- R so that 
R(Ej, Bj) Ex = )~ Rij, Be. 
é 
(see Section 5.4). Then 
Ries = Rie( Ay, £4) = 3 Ry = 3 Rogan: (6.6.2) 


pv=l1 


(Exercise: Prove this.) 
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The Bonnet—Myers Theorem 


Theorem 6.6.2 (Bonnet—Myers). Let M C R" be a complete, connected 
manifold of dimension m > 2 and suppose that there exists a 6 > 0 such that 


Ric,(v, v) > (m—1)6d |u|” (6.6.3) 


for every p€ M and everyv € TyM. Then d(p,q) < 7/V6 for all p,q € M 
and hence M is compact. 


Proof. See page 310. 


Lemma 6.6.3. Let R x [0,1] > M : (s,t) +> ¥5(t) be a smooth map such 
that y := yo : [0,1] > M is a geodesic and y,(0) = (0) and y,(1) = y(1) 
for all s ER. Define the vector field X along y by X(t) := £| s(t) 
forO<t<1. Then 

a 


1 
Fa) Blow) =— f (VX + REA), x) ae 


a (6.6.4) 


1 
= f (IvxP - (x, 5)4,%)) at 
0 
Proof. In the proof of Theorem 4.1.4 we have seen that 


d 


L 
E18) = — i (Vide7s(t), Osrv(t)) dt 


for all s € R (see equation (4.1.10)). Differentiate this equation again with 
respect to s and use the identity V,0; = V0, to obtain 


d? d 
ay Lae Te 8), Us'Ys £ 
gb) =~ a5] fl (Starrs. ds) d 


1 1 
== | (Vs Vi0rYs Oss) dt — | (Vis; Vs0sYs) dt 
0 0 


I 
= | NA RO OP. Ort 
0 


1 
-{ (Vi017s, VsOsYs) dt. 
0 


Now take s = 0. Then V,0;7 = 0 because 7¥ is a geodesic and hence 
d2 
ds? | soy 

This proves the first equality in (6.6.4). To prove the second equality 


in (6.6.4) use integration by parts and the fact that X (0) = 0 and X(1) = 0. 
This proves Lemma 6.6.3. 


i 
E(qs) = -| (ViVLX + R(X,4)4, X) dt 
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Proof of Theorem 6.6.2. Let p,q € M. By the Hopf-Rinow Theorem 4.6.6 
there exists a geodesic y : [0,1] > M such that 


VO=~p, yA=q L(y) =dp,¢q). 


Let X € Vect(7) be a vector field along y such that X(0) = 0 and X(1) =0 
and define s(t) := exp,q)(sX(t)) for se R and 0<t<1. Then s =0 is 
the absolute minimum of the function R > R:s++ E(ys) by Lemma 4.4.1. 
Hence © \s-0E (7s) > 0 and by Lemma 6.6.3 this implies 


1 1 
| (R(X, 4)y,X) dt < / IVE X(t)/? dt. (6.6.5) 
0 0 


Now assume pq and choose an orthonormal frame EF),..., Ey, along y 
such that Ey = 7/|7| and VE; = 0 fori = 1,...m. Define 


X;(t) := sin(at) E;(t) 
fori=1,...mand0<t<1. Then |V,X;(t)| = a cos(zt) for all i and t and 


5(m — 1) |4(t)? < Ricyw(4(d), 4) = 50 (RIE), 110) 70), Fx) 
i=2 
for 0<t<41. Multiply this inequality by sin?(t), integrate over the unit 
interval, and use the identities |7)(t)| = d(p,q) and i sin?(mt) dt = 1/2 to 
obtain the estimate 
d(m — 1) 


1 
“wn a(p,9)? = / 5(m — 1) sin?(mt) [(t)/2 at 
mpl 


< | |VX;(t)|? dt 


1 
(m — 1) | x? cos” (rt) dt 
0 
_ m(m—1) 
- 2 
Here the third step uses (6.6.5). Since m > 2 it follows from this estimate 
that d(p,q)? < 72/6 and this proves Theorem 6.6.2. 


A direct consequence of Theorem 6.6.2 is that every compact manifold 
with positive Ricci curvature has a compact universal cover and hence has 
a finite fundamental group. 
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Positive Sectional Curvature 


Corollary 6.6.4. Let M C R” be a complete, connected manifold of dimen- 
sion m > 2 and suppose that there exists a 6 > 0 such that 


K(p, E) 26 


for every p € M and every 2-dimensional linear subspace E C T,M. Then 


us 
d(p,q) < = 
(Pp; 4) Ti 
for all p,q € M and hence M is compact. 
Proof. The condition Kk > 6 implies (6.6.3) with m := dim(M) and hence 


the assertion follows from Theorem 6.6.2. This proves Corollary 6.6.4. 


The example of the m-sphere shows that the estimate in Corollary 6.6.4 
is sharp. Namely, M := S$” has sectional curvature K = 1 and diameter 7. 
The paraboloid M := (2, y,z) € R?|z = 2?+4+ y’} has positive Gaufian 
curvature and so positive Ricci curvature (Lemma 6.7.2) but is noncompact. 


Remark 6.6.5 (Sphere Theorem). The Topological Sphere Theorem 
asserts that every complete, connected, simply connected Riemannian m- 
manifold M whose sectional curvature satisfies the estimate 


1/4< K(p,B) <1 


for every p € M and every 2-dimensional linear subspace E' C T,M must be 
homeomorphic to the m-sphere. The Differentiable Sphere Theorem 
asserts under the same assumptions that M is diffeomorphic to S$”. 

The problem goes back to a question posed by Heinz Hopf [31, 32] in the 
1920s. After intermediate results by Rauch [57] (with 1/4 replaced by 3/4) 
and others, the Toplogical Sphere Theorem was proved in 1961 by Berger [6] 
and Klingenberg [41]. The Differentiable Sphere Theorem was proved in 
2007 by Brendle and Schoen [8, 9, 10]. They even weakened the assumption 
to 0 < maxg K(p, E) < 4ming K(p, F) for all p € M, where the maximum 
and minimum are taken over all 2-dimensional linear subspaces EC T,M. 

The Topological Sphere Theorem is sharp, as the suitably scaled Fubini— 
Study metric on complex projective space satisfies 1/4 < K(p, E) <1 for 
all p and EF. The Differentiable Sphere Theorem is a significant improve- 
ment, because in many dimensions there exist smooth m-manifolds that are 
homeomorphic, but not diffeomorphic, to S$. These are the so-called ex- 
otic spheres and many of those do not even admit metrics of positive scalar 
curvature [30]. (For the definition of scalar curvature see §6.7.) 
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6.7 Scalar Curvature* 


This section introduces the scalar curvature and explains how it is related to 
the Ricci tensor and the Riemann curvature tensor in dimensions 2 and 3. 
The section also includes a brief discussion of several problems in differential 
geometry in which the scalar curvature plays a central role. 


Definition and Basic Properties 


Let m be a positive integer and let M C R” be an m-manifold. For each 
element p € M denote by Rp: T,M x T,M — End(T,M) the Riemann cur- 
vature tensor and by Ric, : 7,M x T,M — R the Ricci tensor of M at p. 


Definition 6.7.1 (Scalar curvature). Fir an element p € M. The scalar 
curvature S(p) of M at p is the trace of the Ricci tensor and is given by 


m m 
S(p) = } | Riep(ei, es) = ) | (Rp(ei, es)e7, ei), (6.7.1) 
i=1 ij=l 
where €1,...,€m 1s an orthonormal basis of T,M. The scalar curvature is 


independent of the choice of this orthonormal frame. 


Lemma 6.7.2. Assume m = 2 and let K : M > R be the Gaufsian curvature 
of M in (6.4.2). Then 


S(p) = 2K(p), Ricp(u, v) = K(p)(u, v), (6.7.2) 


(Rp(u,v)w, 2) = K(p)((u, 2)(v,w) = (u,w)(v,z)) (6.7.3) 
for allpe M and u,v,w,z€T,M. 


Proof. By definition, the Gaufian curvature is given by 


Kp = Rollo) 


= ufo? — (a0) eee 


2 


for every pair of linearly independent tangent vectors u,v € T,M. Take u 
to be a unit vector orthogonal to v to obtain Ric,(v,v) = K(p)|v|? for 
every v € TM. Since Ric, : T,M x T,M — R is a symmetric bilinear form, 
this implies the second equality in (6.7.2). With this understood the first 
equality in (6.7.2) follows directly from the definition of the scalar curvature 
in (6.7.1). Equation (6.7.3) follows from (6.7.4) by Theorem 6.4.8. This 
proves Lemma 6.7.2. — = a 


6.7. SCALAR CURVATURE* 313 


Lemma 6.7.3. Assume m= 3. Then 
(Rp(u, v)w, z) = Ricp(v, w)(u, z) — Ricp(u, w)(v, z) 


+ Ricp(u, z)(v, w) — Ricp(v, z)(u, w) (6.7.5) 
— 2D) ((u, 2)(v, w) — (u, w)(v,2)) 


2 
for allp€ M and all u,v,w,z€ T,M. 


Proof. Choose an orthonormal basis €1, e2, e3 of the tangent space T,M and 
define the endomorphism 


Qp : TyM + TyM 
by 
oT = Se €4 ej 


for u€ T,M. The right hand side of this equation is independent of the 
choice of the orthonormal basis and the endomorphism Q,, satisfies 


trace(Qy) = S(p) 


and 
(Qpu, v) = Ricp(u, v) 
for all u,v € T,M. With this notation equation (6.7.5) takes the form 


R,(u,v)w = Ricp(v, w)u + (v, w) Cz = “P).,) an 
— Ric,(u, w)u — (u, w) (2,0 = oP) 


It suffices to verify equation (6.7.6) in the following three cases. 
(a) u,v are linarly dependent. 

(b) u,v, w are orthonormal. 

(c) u,v are orthonormal and w = v. 


In the case (a) both sides of equation (6.7.6) vanish. In the case (b) we have 


Rp(u, v)w = (R,(u, v)w, uu + (Rp(u, v)w, v)v 


= Ricp(v, w)u — Ricp(u, w)v, 


and this is equivalent to (6.7.6). 
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In the case (c) equaton (6.7.6) takes the form 


Rp(u, v)u = Ricp(v, v)u — Ricp(u, v)u + Qpu — Sy, (6.7.7) 


To verify this formula, choose a unit vector w that is orthogonal to u and v. 
Then it follows from the definition of S(p) and Q, that 


ae = (Rp(u, v)v, u) + (Rp(w, v)v, w) + (Rp(u, w)w, u) 


= Ricp(v, v) + (Rp(u, w)w, u), 
and 
Qpu = R,(u,v)v + Rp(u, w)w 
= R,(u,v)u + (Rp(u, w)w, v)v + (Rp(u, w)w, u)u 


S 
= R,(u, v)u + Ricp(u, v)v + S0), — Ricp(v, v)u. 


This proves (6.7.7) and Lemma 6.7.3. 


Scalar Curvature in Local Coordinates 


Let d6: U > 21 bea local coordinate chart on an open set U C M with values 
in an open set 2. Cc R™, denote its inverse by 


~:=o¢':050, 
and denote by 
for x € Qandi =1,...,m the local frame of the tangent bundle determined 
by this coordinate chart. Denote the coefficients of the first fundamental 
form by gij := (£;, Ej), of the Ricci tensor by 
Ric; := Ric(£;, £;), 


and of the Riemann curvature tensor by Rj;~¢ and Rij , 80 that 


m 
Rijre = (R(E;, Ej) Ex, Ee), R(Ej, Ej) Ex == So Rijn Be. 
é=1 
Then the scalar curvature is the function S :Q— R given by 
S= > Ric 9" = Ss" Rigo” = Ss" Rijxeg 9. (6.7.8) 
ij=l ijval i,j,k l=1 


(Exercise: Prove this.) 
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Positive Scalar Curvature 


An important question for a compact smooth manifold M is whether or not 
it admits a Riemannian metric of positive scalar curvature. A theorem of 
Lichnerowicz [46] asserts that, if 17 is a compact spin manifold of dimen- 
sion m = 4n that admits a Riemannian metric of positive scalar curvature, 
then a certain characteristic class of this manifold (the A-genus) must van- 
ish. The definitions of the terms that appear in this sentence (spin structure 
and A-genus) as well as in the proof, which involves the Dirac operator, the 
Atiyah—Singer index theorem, and the Weitzenbéck formula, go beyond the 
scope of the present book. For an exposition see [65, Theorem 6.30]. 

A nonlinear variant of Lichnerowicz’ theorem asserts that a compact 
oriented smooth 4-manifold with bs — b; odd and ie > 1 that admits a 
Riemannian metric of positive scalar curvature has vanishing Seiberg-Witten 
invariants (see [65, Proposition 7.32]). 

In another direction Gromov and Lawson [22] proved that if MM, and M2 
are two compact manifolds of dimension m > 3 that admit Riemannian met- 
rics of positive scalar curvature, then so does their connected sum M,#M) 
(see also [65, Theorem 2.18]). 

In the late 1970’s Schoen and Yau [70] proved, using minimal surfaces, 
that the torus T’ = R™/Z™ does not admit a metric of positive scalar 
curvature for m < 7. We remark that the A-genus of the torus vanishes and 
so Lichnerowicz’ theorem does not apply. In [22] Gromov and Lawson refined 
the techniques of Lichnerowicz to prove that, for any m, the m-torus does not 
admit a metric of positive scalar curvature. In fact, they proved that for any 
compact spin manifold M of dimension m the connected sum N := M#T™ 
does not admit a metric of positive scalar curvature. Moreover, they proved 
that if N admits a metric of nonnegative scalar curvature, then this metric 
must be flat and N must be the standard m-torus. 


Constant Scalar Curvature 


An interesting class of Riemannian metrics consists of those that have con- 
stant scalar curvature. In dimension m = 2 the existence of such a metric 
is the content of the uniformisation theorem. The proof involves the 
solution of the Kazdan-Warner equation and goes beyond the scope of this 
book. For an exposition see [65, Theorem 2.20 and Theorem D.1]. 

In dimension two Lemma 6.7.2 shows that the constant scalar curva- 
ture condition is equivalent to constant sectional curvature. However, in 
higher dimensions the constant scalar curvature condition is more general. 
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By Corollary 6.4.12 every compact simply connected m-manifold with con- 
stant sectional curvature is diffeomorphic to the m-sphere, while constant 
scalar curvature metrics exist on every compact manifold. Examples are 
the Fubini-Study metric on complex projective space (Example 2.8.5 and 
Example 3.7.5), locally symmetric spaces (Theorem 6.3.4), and products of 
Riemannian manifolds with constant scalar curvature. 


Definition 6.7.4. Let M be a Riemannian manifold with the metric g. A 
Riemannian metric g’ on M is called conformally equivalent to g iff there 
exists a smooth function X: M — (0,00) such that g! = Ag. The set of all 
such Riemannian metrics is called the conformal class of g. 


Remark 6.7.5 (Yamabe problem). The Yamabe problem asserts that 
the conformal class of every Riemannian metric on a compact m-manifold M 
of dimension m > 2 contains a metric of positive scalar curvature. 

This problem was formulated in 1960 by Hidehiko Yamabe and was even- 
tually settled in the affirmative in 1984 by the combined work of Hide- 
hiko Yamabe [78], Thierry Aubin [5], Neil Trudinger [75], and Richard 
Schoen [69]. The proof for a compact manifold M of dimension m > 2 
relies on finding a positive function f : M— R and a real number c that 
satisfy the Yamabe equation 


A(m — 1) 
—2 
Here S, : M — R denotes the scalar curvature of the Riemannian metric g 


and A, denotes its Laplace—Beltrami operator. In local coordinates this 
operator is given by the formula 


Ais. foe. (6.7.9) 


7 a 
A= 79" Vdet(9) a5 - (6.7.10) 


1 a 
Vdet(g) X Ox 


If f : M > (0,co) is a positive solution of (6.7.9), then the Riemannian 
metric f4/(™-2)g has the scalar curvature c. Exercise: Prove this. Hint: 
Show first that 


Ss 


g =U 7S, + 2(m — 1)uFAgu — (m — 1)(m— 4)u~*|dul?. (6.7.11) 
Then take f := u’/2~! and use the identities 

|du”|2 = meu | dale, (6.7.12) 
Agu” = nu" Agutn(n— Lu"? |dul? (6.7.13) 


for a smooth function u: M — (0,00) and a real number n > 0. 
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Examples of constant scalar curvature metrics arise from the Einstein 
condition (see Lemma 6.7.7 below). 


Definition 6.7.6 (Einstein metric). A Riemannian manifold M is called 


an Einstein manifold iff its Ricci tensor is a scalar multiple of the first 
fundamental form, 1.e. there exists a smooth function \: M > R 


Ricp(u, v) = A(p) (u, v) (6.7.14) 


for allpe€ M and all u,v €T,M. It follows from the definitions that the 
factor X in (6.7.14) is related to the scalar curvature S by 


ee (6.7.15) 


m 


Lemma 6.7.2 shows that every Riemannian metric on a 2-manifold is an 
Einstein metric and the factor \ = K is the Gaufian curvature. 


Lemma 6.7.7. Let M be an Einstein manifold of dimension m > 3. Then 
the scalar curvature of M is locally constant. 


Proof. Let p € M. Then there exists a local orthonormal frame of the 
tangent bundle £1,..., £m € Vect(U) in a neighborhood U of p such that the 
covariant derivatives VE; all vanish at p. (Exercise: Prove this.) Denote 
the deriviative of a function f at p in the direction F;(p) by 0;f. Then 


0 = (We, R)(Ej, Bs) Be, B)) 


juke 
+ SO ((Va, R)(Ex, Ei) En, Ej) + )0((Ve,R)(Ei, Ej) Ex, Ej) 
ik ik 


= 0; DAR (E;, Ex) Ex, Ej) 


Ladi (Ex, Ej) Ex, Ej) + S— op 5” (R(Ei, Ej) Ex, Ej) 
kj 


a ) 0; Ric( Ej, E;) = ) Op Ric( Ei, Ex) 
j k 
—2 
aa | 
m 


Here the the first equality follows from the second Bianchi identity (6.3.7), 
the second holds at p because VE;(p) = 0, the third follows from the def- 
initions of Ric and S, and the last uses the identity Ric(E;, Ej) = 4i;5/m, 
which holds by (6.7.14) and (6.7.15). Since m > 3 it follows that the deriva- 
tive of S vanishes everywhere, and this proves Lemma 6.7.7. 
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Examples of Einstein metrics include all constant sectional curvature 
metrics by Theorem 6.4.8, the Fubini-Study metric on complex projective 
space, and all metrics with vanishing Ricci tensor. Examples of the lat- 
ter are Calabi-Yau metrics on complex manifolds and G»2-structures on 7- 
manifolds. These are again subjects that go far beyond the scope of this 
book. In general, the construction of Einstein metrics and the question of 
their existence is a highly nontrivial problem in differential geonetry. The 
study of this problem has a long history and there are many deep theorems 
and interesting open questions about this subject. 


6.8 The Weyl Tensor* 


This section introduces the Weyl tensor and explains some of its basic prop- 
erties. The section closes with brief discussions of locally conformally flat 
metrics and self-dual four-manifolds. 


Definition and Basic Properties 


Let m be a positive integer and let M C R” be an m-manifold. For each 
element p € M denote by R, : Tp>M x T,M — End(T,M) the Riemann cur- 
vature tensor and by Ric, : T,M x T,M — R the Ricci tensor of M at p. 
Also let S : Md —R be the scalar curvature in Definition 6.7.1. 


Definition 6.8.1 (Weyl tensor). Assume m > 3. The Weyl tensor of 
at an element p € M is the bilinear map 


Wy : TyM x TpM -+ End(T,M) 


defined by 
(W,(u, v)w, z) = (Rp(u, v)w, 2) 
- — (Ricp(v, w) {u, 2) — Ricp(u, w){v, 2)) 
— — (Ric (w, z)(v, w) — Ric,(v, z)(u, w)) a) 
= we =) ((u, z){v,w) — (v, 2)(u, w)) 


for u,v,w,z€TpM. 


Lemma 6.7.3 shows that the Weyl tensor vanishes in dimension three. 
In higher dimensions the Weyl tensor may be nonzero. The next lemma 
summarizes the basic algebraic properties of the Wey] tensor. 
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Lemma 6.8.2. The Weyl tensor W, :T,M x T,M — End(T,M) at an el- 
ement p € M is a skew-symmetric bilinear map with values in the space of 
skew-adjoint endomorphisms of T,M and, for allu,v,w,z € T,M and every 


orthonormal basis €1,...,€m of TpM, it satisfies 
(W,(u, v)w, 2) = (W,(w, z)u, v), (6.8.2) 
W,(u, v)w + W,(v, w)u + W,(w, u)v = 0, (6.8.3) 


SUACH OL Ee; = 0, (6.8.4) 
i=1 


Proof. The skew-symmetry of the Weyl tensor and equation (6.8.2) follow 
directly from the definition and Theorem 5.2.13. It then follows from (6.8.2) 
that W,(u,v) is a skew-adjoint endomorphism of T,M for all u,v € T,M. 
The verification of the Bianchi identity (6.8.3) is a straight forward compu- 


tation which we leave as an exercise. To prove (6.8.4), let u,v € T,M and 


choose an orthonormal basis €1,...,€m of T,M. Then 
SS" (Woles, uv, ei) = So (Roles, u)u, ei) 
i=1 i=l 
t yee. 
t= 3 Y (Riep(u, v) (ei, €4) — Ricp(e;, v)(u, e:)) 
i=1 
1 v/,. 
ee Y- (Ricpler €4)(U, uP) — Ries(u,&) en, v) 
i=1 
S(p) fio. o. oe: 
+ Gas ayGn aay Ly (evento) — (wer (ess2)) 
= Ric,(u, v) 
= a Ricp(u, v) 
S(p) 
a 5 ,v)+ 5 Ricp(u, v) 
S(p) 
m— gi) 
= 0. 


This proves (6.8.4) and Lemma 6.8.2. 


320 CHAPTER 6. GEOMETRY AND TOPOLOGY 


The Weyl Tensor in Local Coordinates 


Let 6: U > be a local coordinate chart on an open set U C M with val- 
ues in an open set 2 C R™, denote its inverse by w := ¢7! : Q > U and 
let E(x) := Ojb(x) € Tyz)M for c € QD andi =1,...,m be the be the local 
frame of the tangent bundle determined by this coordinate chart. Denote 
the coefficients of the first fundamental form by gi; := (£;, Ej), of the Ricci 
tensor by Ric;; := Ric(Ei, E;), and of the Riemann curvature tensor by Rf. ike 
Let S= 0; . Rici; 9 = 5. : Fang ‘I be the scalar curvature in local coor- 
dinates. Then the cofficients Wf. : Q > R of the Weyl tensor are defined 


ijk * 
by W(&, Ej)Er = doy Wi, Ee. and they can be expresses in the form 


Wijre = (W( Ej, Ej) Ex, Ee) = aM ijk Ive 


S > s. 
7 Rijngue + (m — 1)(m— 2) (siean = gix9je) (6.8.5) 
Vv 
1 . . 
a5 (Ricjusie — Ricizgje + Riciegjn — Ricjegix)- 


Conformal Invariance 


Definition 6.8.3 (Locally conformally flat metric). Let M be a Rie- 
mannian manifold. The metric g on M is called locally conformally flat, 
iff for each p€ M there exists a Riemannian metric on M that is confor- 
mally equivalent to g (see Definition 6.7.4) and flat in a neighborhood of p. 


By the local C-A-H Theorem 6.1.16 a Riemannian m-manifold M is lo- 
cally conformally flat if and only if each p € M has an open neighborhood U 
that is conformally diffeomorphic to an open subset 2 C R”, i.e. there 
exists a coordinate chart ¢: U > Q and a smooth function  : U > (0,00) 
such that |v| = A(p)|dd(p)v|R™ for all p € U and all v € T,M. 

A remarkable property of the Weyl tensor is that it remains unchanged 
under multiplication of the Riemannian metric by a positive function and 
so is an invariant of the conformal class of the metric. This is easy to see 
when the function is constant. In that case the Riemann curvature tensor 
and the Ricci tensor remain unchanged, the scalar curvature gets multiplied 
by the inverse, and so the Weyl tensor remains unchanged. As Remark 6.7.5 
shows, the situation is more complicated when instead of multiplying the 
metric by a constant, we multiply it by a nonconstant positive function. The 
conformal invariance of the Weyl tensor can then be proved by a somewhat 
cumbersome calculation in local coordinates. 
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Remark 6.8.4. This discussion shows that the Weyl tensor vanishes for 
every Riemannian metric that is locally conformally flat. In fact, it turns 
out that in dimension m > 4 the Weyl tensor of M vanishes if and only if 
the Riemannian metric on M is locally conformally flat (see [43]). More- 
over, a theorem of Kuiper [33, 42] asserts that every compact, connected, 
simply connected Riemannian m-manifold that is locally conformally flat is 
conformally diffeomorphic to the m-sphere with its constant sectional curva- 
ture metric. Thus every compact, connected, simply connected Riemannian 
manifold of dimension m > 4 that is not diffeomorphic to S™ must have a 
nonvanishing Weyl tensor. 


Self-Dual Four-Manifolds 


The lowest dimension in which the study of the Weyl tensor is interesting 
is m = 4. To explain this, it is useful to consider the notion of an oriented 
manifold. 


Definition 6.8.5 (Orientation). An orientation of an m-manifold M is 
a collection of orientations of the tangent spaces T,M, one for eachp € M, 
that depend continuously on p, i.e. if E,,..., Em are pointwise linearly inde- 
pendent vector fields in a connected open neighborhood U C M of p and the 
vectors E\(p),...,Em(p) form a positive basis of T,M, then for every q € U 
the vectors E\(q),...,Em(q) form a positive basis of TyM. In the intrinsic 
language an oriented manifold is one equipped with an atlas such that all the 
transition maps are orientation preserving diffeomorphisms. 


Definition 6.8.6 (2-Form). Let M be a smooth manifold. A 2-form on M 
is a collection of skew-symmetric bilinear maps w,:T,M x T,M — R, one 
for each p € M, which is smooth in the sense that for every pair of smooth 
vector fields X,Y on M the assignment p++ wp(X(p), Y (p)) defines a smooth 
function on M. The space of all 2-forms on M is denoted by 27(M). 


In the similar vein the Weyl] tensor of a Riemannian m-manifold M can 
be thought of as 2-form with values in the endomorphism bundle End(7'M). 
By the symmetry properties in Lemma 6.8.2 the Weyl tensor induces a linear 
map W : 0?(M) > 0?(M) via the formula 


(Ww) (u,v) = > (Wp(u, ves, €j)Wp(Ex, €;) (6.8.6) 
1<i<j<m 


for w € 0?(M), p € M, and u,v € T,M, where €1,...,@€m is an orthonormal 
basis of T,M. The right hand side of equation (6.8.6) is independent of the 
choice of this orthonormal basis and is a 2-form by Lemma 6.8.2. 
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Now let M be an oriented Riemannian 4-manifold. Then a 2-form w 
on M is called self-dual iff it satisfies the condition 


Wp(€0, €1) = Wp(€2, €3) (6.8.7) 


for every p € M and every positive orthonormal basis e€9, €1, €2, e3 of T,M. It 
is called anti-self-dual iff it satisfies (6.8.7) for every p € M and every neg- 
ative orthonormal basis €o, e1, €2, €3 of T,M . Thus w is anti-self-dual if and 
only if it is self-dual for the opposite orientation. Denote the space of self- 
dual 2-forms by 0?:+ (IM) and the space of anti-self-dual 2-forms by 0? (M). 
Then there is a direct sum decomposition 


0?(M) = 0?+(M) @ 0? (M). 


Lemma 6.8.7. Let M be an oriented Riemannian 4-manifold. Then the 
linear operator W : Q?(M) — 9?(M) in (6.8.6) preserves the subspace of 
self-dual 2-forms and the subspace of anti-self-dual 2-forms. 


Proof. Fix any orthonormal basis eo, €1, €2,€3 of T,M and abbreviate 
Wijk = (Wo Ci,-€7 en, 2) 
for i,j,k, € {0,1,2,3}. Then by Lemma 6.8.2 we have 
Wijke = —Wyike = —Wijek = Wreiz, Wijke + Wjkie + Weige =0, (6.8.8) 


woijo + Wiig + Waij2 + W3iz3 = 0 (6.8.9) 


for all i,j,k, @. It follows from (6.8.8) and (6.8.9) that 


W0102 = W2331; W0103 = W2312; W0203 = W3112; (6 8 10) 


W2302 = WO0131; W2303 = W0112; W3103 = W0212- 


Namely, by (6.8.8) each of these six identities is equivalent to an equation 
of the form yar Wijki = 0 with 7 # k and this holds by (6.8.9). Use (6.8.8) 
and (6.8.9) again to obtain 


W0101 — W2323 — Wo202 T W3131 


= W220 + W1221 + W3223 W0110 — W2112 W3113 
=0 


and hence, by cyclic permutation of 1, 2, 3, 


E := W0101 — W2323 = W0202 — W3131 = W0303 — W1212.- 
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Since woio1 + wo202 + W303 = 0 by (6.8.9), this implies 


BE = W221 + W332 + W1331 = W1221 — W0330 = W0303 — W1212 = E. 


Thus ¢ = 0 and so 


W101 = W2323; Wo0202 = W3131; W0303 = W1212- (6.8.11) 


Now assume that 7 is a nonzero self-dual 2-form and fix an element p € M. 
Then there exists a positive orthonormal basis eo, €1, €2,e3 of T,M and a 
real number A # 0 such that r(eo,e1) = T(e2,e3) = A and r(e;,e;) = 0 
for all other pairs i,j. Hence (Wr) (e;, e;) = Awo1ij + AW234; for all i and j. 
Take \ = 1 and use the equations (6.8.8), (6.8.10), and (6.8.11) to obtain 


(Wr)(€0, €1) = Wo101 + W2301 = W123 + W2323 = (WT)(e2, €3), 
(Wr) (eo, €2) = Wo102 + W2302 = Wo131 + We331 = (WT) (e3, €1), 
(Wr)(eo, €3) = wo103 + wW2303 = Wo112 + We312 = (Wr)(e1, €2). 


Thus W7 is self-dual. The anti-self-dual case follows by reversing the orien- 
tation. This proves Lemma 6.8.7. 


Lemma 6.8.8. Let M be an oriented Riemannian 4-manifold and denote 
by W : 0?(M) > 02(M) the linear operator determined by the Weyl tensor 
via (6.8.6). Then the following are equivalent. 


(i) If rT € 27(M) is anti-self-dual, then Wr = 0. 
(ii) The Weyl tensor W satisfies satisfies the equation 


W,(eo, €1) = W,(e2, €3) (6.8.12) 
for every p € M and every positive orthonormal basis eg, €1, €2,€3 of TpM. 


Proof. We prove that (i) implies (ii). Thus assume (i) and choose a positive 
orthonormal basis e€9, €1, €2,€3 of T,M. Let 7 be the 2-form defined by 


eg.bt 1, T(e2, €3) := —1 
and r(e;,e;) := 0 for all other pairs i,j. Then 7 is anti-self-dual and 
hence Wr = 0 by (i). Thus it follows from (6.8.2) and (6.8.6) that 


(W (eo, e1)u — W(eg, €3)u, v) = (Wr) (u,v) = 0 


for all u,v € T,M and this proves (ii). 
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Conversely, suppose that (ii) holds, choose a positive orthonormal ba- 
Sis €0, €1, €2,e3 Of T,M, and let 7 be an anti-self-dual 2-form. Then 


M1 = T(€0, €1) = —T(€2, €3), 
Az = T(€o, €2) = —T(e3, €1), 
A3 := T(e€9, €3) = —T(€1, €2) 


and hence 
Wr = A1((W (eo, 1) — W (ea, e3)):, -) 
+ A2(( (es, €1))-5°) 
+ A3(( (e1, €2))-,-) =0 
by (ii). This proves Lemma 6.8.8. 


(eo, €2 


WwW 
W (eo, €3 


)-W 
)-W 


Definition 6.8.9. An oriented Riemannian 4-manifold is called self-dual 
if its Weyl tensor satisfies the equivalent conditions of Lemma 6.8.8. It is 
called anti-self-dual iff it is self-dual for the opposite orientation. 


Examples of self-dual 4-manifolds are all 4-manifolds with constant sec- 
tional curvature by Theorem 6.4.8, or more generally all locally conformally 
flat 4-manifolds such as S$! x S?°. Other examples are the complex projec- 
tive plane with its Fubini-Study metric (Examples 2.8.5 and 3.7.5), and 
Ricci flat Kahler surfaces with the opposite of the complex orientation (the 
K3-surface and the Enriques surface). Compact simply connected smooth 
4-manifolds with signature zero that are not diffeomorphic to the 4-sphere, 
such as S? x S$? or the one-point blowup of the projective plane, do not 
admit any self-dual metrics by Kuiper’s theorem (Remark 6.8.4), because 
every self-dual metric on such a manifold is locally conformally flat. For a 
survey of these basic examples see Kalafat [34]. 

The study of self-dual 4-manifolds was initiated by Penrose [56] and 
Atiyah—Hitchin-Singer [4] and was later taken up by many authors including 
Taubes [73], LeBrun [45], and Donaldson [18]. 

In [18] Donaldson proposes to study the moduli space of self-dual metrics 
on a compact oriented smooth 4-manifold (without boundary) modulo the 
action of the group of diffeomorphisms. This is a very difficult problem. The 
self-duality equation is a system of nonlinear partial differential equations 
and they do give rise to a “finite-dimensional moduli space”. However, this 
space may be highly singular and it is noncompact unless it is empty. It 
would need to be “compactified” in a suitable way, one would need to find a 
way to understand the singularities, and one would have assign to it some 
kind of “virtual fundamental class” to obtain numerical invariants by pairing 
this class with suitable cohomology classes in the ambient space. 


Chapter 7 


Topics in Geometry 


This chapter explores various topics in differential geometry that are central 
to the subject and accessible with the material covered in this book, but go 
beyond the scope of a one semester lecture course. The first section builds 
on the material in Chapter 4 about geodesics and can be read directly after 
that chapter. It introduces conjugate points on geodesics and proves the 
Morse Index Theorem. This result is then used to show that every geodesic 
without conjugate points minimizes the length locally (and strictly) in the 
space of all curves joining the same endpoints and, conversely, that locally 
minimizing geodesics have no interior conjugate points (§7.1). This result in 
turn plays an essential role in the proof of the Continuity Theorem for the in- 
jectivity radius (§7.2). The next section examines the group of isometries of 
a connected Riemann manifold and contains a proof of the Myers-Steenrod 
Theorem, which asserts that the isometry group always admits the natural 
structure of a finite-dimensional Lie group (§7.3). The proof given here 
has several parallels to the proof of the Closed Subgroup Theorem. This 
section is based on the study of isometries and the Riemann curvature ten- 
sor in Chapter 5 and can be read directly after that chapter. The present 
chapter then deals with the specific example of the isometry group of a com- 
pact connected Lie group equipped with a bi-invariant Riemannian metric 
(§7.4). The last two sections are devoted to Donaldson’s differential geo- 
metric approach to Lie algebra theory [17]. They build on the material in 
Chapter 6 and include Donaldson’s existence theorems for critical points of 
convex functions on Hadamard manifolds (§7.5) and his existence proof for 
symmetric inner products on simple Lie algebras (§7.6). Corollaries include 
the uniqueness of maximal compact subgroups, and Cartan’s theorem about 
the compact real form of a semisimple complex Lie algebra. 
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7.1 Conjugate Points and the Morse Index* 


This section introduces conjugate points on geodesics and contains a proof 
of the Morse Index Theorem. As an application we prove that every geodesic 
without conjugate points minimizes the length and the energy locally (and 
strictly) among all nearby curves joining the same endpoints. Conversely, 
locally minimizing geodesics have no interior conjugate points. The results 
of this section will be used in §7.2 to prove the Continuity Theorem for the 
injectivity radius. Assume throughout that M@ C R” is asmooth m-manifold 
and recall the notation Q,,, for the space of all smooth paths ¥ : [0,1] > M 
with the endpoints y(0) = p and y(1) = q (84.1.2). 


Conjugate Points 


We have seen in Theorem 4.4.4 and Corollary 4.4.6 that geodesics minimize 
the length on short time intervals. A natural question arising from this 
observation is how long the time interval can be chosen on which a geodesic 
minimizes the length at least locally in some neighborhood in the space of 
paths. An answer to this question is closely related to the Hessian of the 
energy functional EF : Qp,. + R in (4.1.2). It was established in Lemma 6.6.3 
that the Hessian of the energy functional FE at a geodesic ¥ : [0,1] > M is 
the linear operator H., : Vecto(y) — Vect(y) defined by 


HX := —VVX — R(X,4)¥. (7.1.1) 


The domain of this operator is the space Vecto(y) of all vector fields X 
along y that vanishes at the endoints, i.e. X(0) = 0 and X(1) = 0. Recall 
that a Jacobi field along a geodesic is a solution of the equation 


VVX + R(X,4)7 =0, (7.1.2) 


and so the kernel of H, is the space of Jacobi fields along y that vanish at 
the endpoints (Lemma 6.1.17). 


Definition 7.1.1. Let 7: [a,b] + M be a geodesic. A conjugate point 
is a real number 7 in the interval a < 7 < b such that there exists a non- 
vanishing Jacobi field X along the restriction | [a,x] that satisfies X(a) =0 
and X(t) =0. The dimension of the space of solutions of this equation is 
called the multiplicity of the conjugate point rt and will be denoted by 


VVX + R(X, 47 =0, 


ty) = aim { X € Vect(Y|ja,7]) Nay 0 i) 0 } 6 (MS) 


Thus m,(T) = 0 whenever T is not a conjugate point of y. 
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In §4.4 we have addressed the question when a geodesic ¥ : [0,1] + M 
with the endpoints y(0) = p and 7(1) = q minimizes the lengths of curves 
with the same endpoints globally, i.e. when it satisfies L(y) = d(p,q). A 
weaker variant of this question is whether it minimizes the length locally, 
i.e. only among nearby curves in 92,4. Here the word “nearby” can have 
several meanings, depending on which topology on the space 2, is used. 
The relevant topologies in the present setting are the C! topology and the 
C@ topology. Since our manifold M is embedded in R” these topologies are 
induced by the distance functions defined by 


doi(7,7') = sup |y(t) -— 7+ sup |¥(¢) — 4/(), 
0<t<1 0<t<1 
(7.1.4) 


k 
Ps Sook supocr<il ge (7(t) — 7/4) 
co ; ae ke 

hao) )©— + 8UPp<i<i| (y(t) = ¥'(t))| 
for y,7' € Qpq. Note that, if y € Qpq satisfies min,|+(t)| > 0, then so does 
every curve 7’ in a sufficiently small C!-neighborhood of ¥ in Dg: 
Theorem 7.1.2. Let y: [0,1] — M be a nonconstant geodesic with the end- 
points y(0) = p and y(1) = q. Then the following holds. 
(i) If there exists a C™®-open neigborhood UC Qyq of y such that every 
curve y' EU satisfies L(y’) > L(y), then y has no conjugate points T in the 
open intervalO <7 <1. 
(ii) If y has no conjugate points rT in the interval 0 <7 <1, then there ex- 
ists a C!-open neighborhood U C Qyq of y such that every curve 7/ € U 
satisfies L(y’) > L(y) with equality if and only if there exists a diffeomor- 
phism p: [0.1] + [0,1] such that p(0) = 0, p(1) =1, and 7’ =yoop. 
Proof. See page 332. 


Example 7.1.3. The archetypal example of a conjugate point is the end- 
point of a geodesic y : [0,1] > S? on the unit sphere S? C R® of length 7. 
This is the endpoint of an arc around half of a great circle, and at this point 
the geodesic seizes to be locally unique as there is a continuous family of 
geodesics joining north and south pole (the meridians). 


Example 7.1.4. The absence of conjugate points does not signify that a 
geodesic y € Qp,q minimizes the length globally. The sinplest example is 
a geodesic around the unit circle M = $1 of length L(y) > 7. Similar 
examples exist on the torus M = T™, on the cylinder M = R x S', and 
also on simply connected manifolds such as the sphere S$? with a suitably 
chosen Riemannian metric (so that an open subset U C S? is isometric to a 
cylinder (0,1) x $+, for example). 
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The Morse Index Theorem 


Definition 7.1.5 (Morse index). Let 7: [0,1] + M be a geodesic. The 
Morse index of y is defined as the maximal dimension of a linear sub- 
space X C Vecto(y), such that 


1 
/ (IVXP — (R(X,4)9,X)) dé <0 (7.1.5) 


for every nonzero element X € X. It will be denoted by 


Xx is a linear subspace of Vecto(X) (7.1.6) 
and (7.1.5) holds for all X © X¥\ {O} fo <7" 


p(y) = max { aim(%) 


This is also the number of negative eigenvalues, counted with multiplicities, 
of the operator Hy in (7.1.1). 


Theorem 7.1.6 (Morse Index Theorem). Let 7 : [0,1] ~ M be a 
geodesic. Then y has only finitely many conjugate points T, each with multi- 
plicity 1 <m,y(T) <‘m, and the Morse index of y is the number of conjugate 
points in the open interval 0 <7 <1, counted with multiplicities, t.e. 


wy) = D7 my(7). (7.1.7) 


0<T<1 


Proof. See page 330. 


Exercise 7.1.7. Prove that geodesics on manifolds with nonpositive sec- 
tional curvature have no conjugate points. So their Morse indices are zero. 


Exercise 7.1.8. Find geodesics on the unit sphere M = S$? C R?® with 
arbitrarily large Morse index. 


The proof of Theorem 7.1.6 requires some background in functional anal- 
ysis which goes beyond the scope of this book and for which the interested 
reader is referred to [12]. Remark 7.1.9 below lists the main properties of 
the operator H, that are used in the proofs of Theorems 7.1.2 and 7.1.6. 
It will be convenient in some places to use the L? inner product as 


1 
(X,Y) p2 =| (X(t), Y(t)) dt 


and the associated norm 


1 
[Xllo0= / X(t)? dt 


for vector fields X,Y along ¥. 
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Remark 7.1.9 (Properties of H.,). We begin with the properties that 
can be easily verified directly, without an appeal to Hilbert space theory. 
(a) Each eigenvalue of H, is a real number. 

This assertion is a consequence of the fact that the operator H., is symmetric, 
ie. (HyX,Y) 72 = (X,H,Y) 72 for all X,Y € Vecto(y). (Exercise: Verify 
assertion (a) by complexifying the space of vector fields along y.) 

(b) Each eigenvalue of Hy has multiplicity at most m. 

The eigenspace of H., for an eigenvalue \ consists of solutions of the equa- 
tion —VVX — R(X,7) =AX with X(0) =0. Every solution is uniquely 
determined by V_X (0) and is an eigenfunction for if and only if X(1) = 0. 
(c) Every X € Vecto(y) satisfies the Poincaré inequality 


1 1 
| IX(t)/ < =f IW. 2 de. (7.1.8) 
0 0) 


To see this, choose a parallel orthonormal frame Fj(t),...,Em(t) € Ty)M 
of the tangent bundle along y and write X(t) = )>)", &(t)E,(t). Then the 
inequality (7.1.8) takes the form 7? Solée at < Sole? at and this can be 
proved by writing € as a Fourier series €(t) = )°7°., sin(wkt)€,. (Exercise: 
Prove the estimate (7.1.8). Verify that it is sharp.) 


(d) Let cy := sup, supp,j=1(R(v, y()) (4), v).. Then, for all X € Vecto(7), 


[ (wx? - (a0%,994.%)) a> 0) fIxPat 


This follows directly from the Poincaré inequality in (c). 

(e) Let A be an eigenvalue of Hy. Then A > 1? — cy. 

There exists a nonzero element X € Vecto(y) such that HyX = AX. By (d) 
this implies \||X||?2 = (X,H,X) ;2 > (x? — c,) ||X|lZ2 and so A > 1? — c,. 
(f) The set o(H,) of eigenvalues of H, is a discrete subset of R. 

Let A € o(H,) and let II, be the orthogonal projection onto the eigenspace 
of A. Then the operator H, — Aid + II) : Vecto(y) — Vect(y) is bijective, 
and hence so is the operator H.,—X’id+TI) for X’ sufficently close to A. Such 
a number ’ cannot be an eigenvalue of Hy. This argument uses the fact 
that 1 is a Fredholm operator between appropriate Sobolev completions. 
(g) The smallest eigenvalue of Hy is the supremum of all real numbers a 
that satisfy the inequality 


[ (wx = (eX; 4, X)) dt > af ixoat (7.1.9) 


for all X € Vecto(y). 
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To prove this, define a € R to be the infimum of the integrals on the left 
in (7.1.9) over all X € Vecto(y) with ||X||;2=1. This infimum is finite 
by (d). Now choose a minimizing sequence X; € Vecto(y) with ||X;||,2 = 1. 
This sequence satisfies a uniform upper bound on ||V.X;||;2. Hence, by the 
theorems of Banach—Alaoglu and Arzela-Ascoli, there exists a subsequence 
that converges weakly in the Sobolev space W!? and strongly in the supre- 
mum norm to a weak solution of the equation -VVX — R(X,7)V = aX 
with zero boundary condition. Now one can verify by a bootstrapping ar- 
gument that every weak solution is smooth. Hence (7.1.9) holds, a is an 


eigenvalue of H,, and every eigenvalue \ of 1, satisfies \ > a. 


(h) The Morse index of y is finite. 

The operator H, has only finitely many negative eigenvalues by (e) and (f), 
and the direct sum Vy of their eigenspaces is finite-dimensional by (b). More- 
over, every nonzero element X € %, satisfies the inequality (X,H,X)p2 < 0. 
One can then repeat the argument sketched in (g) for the L? orthogonal 
complement of 4, to show that (X,H,X) 2 > 0 for all X € AG Hence the 
dimension of 4, is the Morse index of y (Definition 7.1.5). 


Proof of Theorem 7.1.6. Choose a parallel eT ee 
F,(t), ...; Em(t) € Ts)M 
and, for 0 < t < 1, define the symmetric matrix S(t) = S(t)’ € R™*™ by 
S(t) = (Sig(t))ija1, Sig (t) == (R(Ex(t), 1) 9), Ey), 
Let I := [0,1] and define the operator A : Co°(/,R™) > C™°(U.R™) with 
the domain Cf°(J, R™) := {€ € C(I, R™) | €(0) = €(1) = 0}, by 
A€é := —€ — SE. 


This operator is isomorphic to the operator (7.1.1) via the isomorphism that 
sends € € Co°(I, R™) to X := 50, & E; € Vecto(y). Now define a family of 
operators A, : Co°(I,R™) — C*®(I.R™) by 


(ArE) (8) = E(t) — S(rH EC) 
for 0<7<1. Then A, is isomorphic to the operator (7.1.1) on the in- 
terval [0,7] via the isomorphism that sends € € C9°(J,R™) to the vector 
field X(t) := 0, €(7 ")E,(t) € Ty1)M, O0<t <7, along the curve y\j,7)- 
Thus 7 is a conjugate point of y if and only if A; has a nontrivial kernel, 
and the dimension of the kernel is the multiplicity m,(z). 
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By Remark 7.1.9 A, has only positive eigenvalues for small positive 7. 
For 0 < 7 < 1 define the operator A; : C§°(J, RR”) > C(I, R™) by 


(A,€)(t) = S(4,)(4) = &e) — a8 (rH E(). 


If 7 is a conjugate point, we claim that every element € € ker(A;,) satisfies 
1 . 
TH) = fe Aré) dt = - SOP. (7.1.10) 


To see this, note that €(t) + 7?.S(rt)€(t) = 0 and 4 9(rt) = 7$(rt). Hence 


1 on . 
ro) =f (Gee. ~ Hele). rS(rHE)) a 
vl 
= [ (GeO) + HEH, S(rE—) + EW, SlrH}E(e)) a 
i 


73 
i d fs 

=- | 5 (Her) a 
1. 

= - 60)? 


This proves (7.1.10). Note also that T,(€) < 0 unless €(t) = 0 for all t. 

With this understood, we appeal to the Kato Selection Theorem [35, 
Theorem II.5.4 and Theorem II.6.8]. It asserts in the case at hand that, 
near each point 7 with k := dim(ker(A,,)) > 0, there exist k continuously 
differentiable functions A; : (7| — €,7 +¢€) > R such that ;(7) = 0, the 
numbers \1(7),...,A“(7) are the eigenvalues of A, near zero, with mul- 
tiplicities accounted for by repetitions, and the derivatives di(T0) are the 
eigenvalues of the crossing form I’,, : ker A,, + R in (7.1.10), again with 
multiplicities accounted for by repetitions. The derivatives are all negative 
by (7.1.10). Hence there exists a 6 > 0 such that A;(7) > 0 for ™—d < T < 7 
and A;(T) < 0 for 7 <7 <7+06. Hence conjugate points are isolated, and 
for 7] < T < 7+ 6 the operator A, has precisely k more negative eigenvalues 
than for T — 6 < 7 < 7. Hence the number of the negative eigenvalues of A; 
with multiplicities is }{9---, dim(ker(A-,)). This proves Theorem 7.1.6. 


For more detailed explanations and other closely related applications of 
the Kato Selection Theorem the reader is referred to [61, 76]. 
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Locally Minimal Geodesics 
Proof of Theorem 7.1.2. We prove part (i). Let X € Vecto(M) and define 
s(t) := expt) (sX (¢)) 


for 0 <t < 1 and —éd < s < 6. Here 6 > 0 is chosen so small that sX(t) 
is contained in the domain V4) C Ty@)M of the exponential map for all t 
and all s with |s| <6. Then lims_,9dc~(7,7s) =0 and so ys €U for s 
sufficiently small. Hence the function (—d,6) > R: s + L(ys) has a local 
minimum at s=0, and since E(y) = L(y)?/2 and L(ys)?/2 < E(ys), so 
does the function s ++ E(y;). By Lemma 6.6.3 this implies 

d2 


0 B(ys) = / *(|WXP = (R(X, 4)4,.X)) at 


i ee 
_ 2 
ds s=0 


This inequality shows that the operator H, has no negative eigenvalues. 
Hence, by Theorem 7.1.6 the geodesic y has no conjugate points 7 in the 
interval 0 <7 <1. This proves (i). 

We prove part (ii) in six steps. Let y € Qp be a nonconstant geodesic 
without conjugate points. 


Step 1. There exist constants 6, >0 and e > 0 such that 


[ (wx — (R(X,4/)¥,X)) at > ef (Iwxor +IX()P) dt (7.1.11) 


for every curve 7! € Qyq with dci(y,7) < 61 and every X € Vecto(7’). 


By Theorem 7.1.6 the Hessian H, has no negative eigenvalues and, since 1 
is not a conjugate point of y, also zero is not an eigenvalue of H,. Hence 
the smallest eigenvalue of H., is positive and so, by part (e) of Remark 7.1.9, 
the estimate (7.1.9) holds for all X € Vecto(y) with a positive constant a. If 
the constant 6 > 0 is chosen such that (R(v,7(t))4(t), v) < b|v|? for all t and 
all v € T,q)M, we obtain the estimate (7.1.11) for 7 = y and X € Vecto(7) 
with ¢ :=a/(a+b+1). (Exercise: Verify this.) In a local coordinat chart 
on M the integrand on the left in (7.1.11) has the form 


So (E+ rhe) ane (E+ DIP) — YT Rignetichetel, 
ke ij LV ij ke 

where c(t) := $(9/(#)) and €(t) = dd(7’(t))X (t) (and the coefficients gj,, rk, 

and R;j;,¢ are functions of c(t)). This expression depends continuously on the 

curve c and its derivative é. Hence there exists a constant 6; > 0 such that 

every curve 7/ € Qp4 with doi(y,7') < 61 and every X € Vecto(7’) satisfies 


the estimate (7.1.11) with « = a/2(a+b+1). This proves Step 1. 
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Step 2. Let 6, > 0 be as in Step 1. Then there exists a constant 62 > 0 with 
the following significance. If Y € Vecto(7) satisfies 


sup (iv@| ee IVY (t)|) 2 by, (7.1.12) 
0<t<1 


then the curve 7 := exp,(Y) € Opq satisfies dci(y, 7’) < 61. 
For 0 <¢ <1 define the map ft : Via) x T,)M > TM by 


felv, 8) = (exp((0), 0), dexp((0),¥) (4(4),8 + hyn (GO), v))) (7.1.13) 


for v,v€ T,y)M. By Lemma 4.3.6 the map (t,v,v) > fe(v,0) from the 
set U := {(t,v,0)|0<t <1, uv EV, 0 € T,)M} to TM is smooth and it 
satisfies f,(0,0) = (y(t), 7(t)). Thus there exist constants rj > 0 and C, > 1 
such that all ¢ and all v,0 € T,q)M with |v| + |v] <1 satisfy v € V,q) and 


P—Ol+P- WH) <SCi(lol +l), = f,0). (7.1.14) 


Choose Y € Vecto(y) such that |Y(t)|+|VY(t)| <71 for 0 < t < 1 and 
define 7'(t) := exp,)(Y(¢)). Then it follows from the Gau8—Weingarten 
formula in Lemma 3.2.3 that (7 (t),*/(t)) = fi(Y (¢), VY (0) for all t. Hence 
the estimate (7.1.14) shows that Step 2 holds with 62 := min{r1, 61/C;}. 


Step 3 Let 62 > 0 be as in Step 2. Then there exists a constant 63 > 0 with 
the following significance. If y' € Qyq satisfies the inequality dci(y,7') < 63, 
then there exists a unique vector field Y € Vecto(y) satisfying (7.1.12) and 


YW] <i); M), 1H) =expyn(¥(O) (7.1.15) 
for0O<t<1l. 
Define p := info<¢<1 inj(y(t); M) > 0 and let f; be the map in (7.1.13). 
Its derivative at the origin is given by df:(0,0)(n,7) = (7,7 + hy (F(4), 0) 
and so is invertible. Hence, by the implicit function theorem, there exist 
constants r2 > 0 and C2 > 1 such that, for 0 <¢ < 1, the set 


Qt = {(p,P) € TM | |p— y(t) + |P- 4 < ra} 


is compact, f; restricts to a diffeomorphism from a compact neighborhood 
of the origin in B,(y(t)) x Ty@)M to Q;, and every pair (p,p) € Q; satisfies 


e+ [81 <Co(p—yOl+-VOl), 3) = FB). (7.1.16) 


Choose 7 € Qp,q such that doi(y,7) < ra. Then (7(t),7(¢)) € Q: for all t 
and hence there exists a unique vector field Y € Vecto(y) satisfying (7.1.15). 
By (7.1.13) it also satisfies (Y(t), VY (t)) = f7'(9'(t), (6) for all t. Hence 
the estimate (7.1.16) shows that Step 3 holds with 63 := min{ra, d2/C2}. 
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Step 4. Let 63 be as in Step 3. Then there exists a constant 64 >0 with 
the following significance. If y/ € Qpq satisfies dci(y,7) < 64, then the 
map p: [0,1] > [0,1] defined by p(t) := L(y’)7! ly (s)| ds for0<t<1 is 
a diffeomorphism and dcu(y,7y' ° p+) < 63. 

Define c := L(y) and C := sup,|¥(t)|. Then c > 0 because 7 is nonconstant. 
We claim that Step 4 holds for every constant 64 > 0 that satisfies 


12 
ba < (s2 # =) ea (7.1.17) 


To see this, fix a constant 64 > 0 that satisfies (7.1.17) and let 7’ € Qp,q such 
that dcoi(y, 7’) < 64. Since the geodesic y is parametrized by the arclength, 
it satisfies |+(t)| = L(y) =c. Hence ||4’(t)| — c| < 64 for all t. Since 64 < c 
by (7.1.17), this implies that p is a diffeomorphism. It also implies the length 
inequality |L(y’) — c| < 64 and hence 


<a 


Define 8 := p—! and 7" := 7 0 B. Then 8(p(t)) = 1/A(t) and hence 
1—- 12 
_ [=A . 2 


: 264 464 
3 t)-1 : 
<3, [p< <= 


|B(o(t)) — 1| At) : 
This implies 
BQ -1< 4, a-< 4, scr. 
Hence 
ly(t) — ¥"()| < ly — 16) + ly(2@) - 76) 
< elt — BY) +daly,7) 
< 1364 
and 


lH) -— 7 @1 < FO — 18H) + 11 - BOIMEM)!| 
+ BEBE) - ¥(6@)| 
< Olt — B(t)| + ¢|1 — B(t)| + B@)der (7, 7’) 
< (25 +1247) 4. 


These inequalities imply dcoi(y, 7") < (12C/c + 32)64 < 63 by (7.1.17) and 
this proves Step 4. 
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Step 5. Let 63 > 0 be as in Step 3 and let y' € Qyq such that dci(y, 7’) < 63. 
Then E(9’) > E(y) with equality if and only if 7 = y. 
Assume yy’ # 7. By Step 3 there exists a nonzero vector field Y € Vecto(y) 
satisfying (7.1.12), (7.1.15), and 7’ = exp,(Y). For 0 < s,t < 1 define 
s(t) = exp,z)(s¥ (¢)). 
Then yo = 7, 71 = 7’, and Step 2 asserts that 
doi(7, 7s) < 61 forO<s<1. 

Hence it follows from the argument in the proof of Lemma 6.6.3 that 

d2 

ds? 
for all s. Here the equality uses the identity V,0.7; = 0 and the inequality 
follows from Step 1 and the fact that 0,7; = dexp,(sY)Y is a nonzero vector 
field along y;. Thus the curve [0,1] > R: s+ E(4,) is strictly convex. Since 


its derivative vanishes at s = 0, it follows that E(7;) > E(y) for0<s <1. 
Since 7; = 7, this proves Step 5. 


Step 6. Let 54 be as in Step 4. Then the C!-open set 
U = {7 € M9 | dor(y,7') < 4} 
satisfies the requirements of part (ti) in Theorem 7.1.2. 


Let y/ €U and define the diffeomorphism p: [0,1] > [0,1] as in Step 4. 
Then, by Step 4 we have doi(y7,7/ 0 p7!) < 63 and hence, by Step 5 this 
implies E(7' 0 p~!) > E(y) with equality if and only if 7/0 p-! = 7. Since y 
and 7/0 p~! are parametrized by the arclength, we find that 


L(y’) = L(/ 0 p™*) = V2E(/ 0p?) > V2E(y) = Ln), 
1 


1 
Be) =f (|Reavral? — (RO rre,8e7-)2ere, 8x74) at > 0 


and that equality holds if and only if 7/ o p~* = y. Thus every curve 7/ € U 
that arises from y by reparametrization has the same length as y and every 


other curve in U is strictly longer. This proves Step 6 and Theorem 7.1.2. 


We remark that there is a precise analogy between Theorem 7.1.2 about 
geodesics (extrema of the energy functional £) and extrema of a function 


f:R° SR 


on a finite-dimensional vector space. If the function f has a local minimum 
at a point x9 € R”, then the Hessian of f at xo has no negative eigenvalues 
and, conversely, if xo is a critical point of f and all the eigenvalues of the 
Hessian at xo are positive, then zo is a strict local minimum of f. 
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7.2. The Injectivity Radius* 


Assume throughout that MM CR” is a smooth m-manifold. In this section 
we prove that the function which assigns to each point p € M its injectivity 
radius inj(p; M) € (0, 00] is continuous. Recall the concept of a local diffeo- 
morphism as a smooth map between manifolds of the same dimension whose 
derivative at each point is a vector space isomorphism (Definition 2.2.20). 
Recall also the notation V, C T,M for the domain of the exponential map 
at p and the notation B,(p) = {v € T,M ||v| <r} for pe M andr > 0. 


Theorem 7.2.1. The function M — (0,00] : p+ inj(p; M) is continuous. 


Proof. See page 339. 


The proof of Theorem 7.2.1 is based on two lemmas. 


Lemma 7.2.2. The set U; := {p€ M|r < inj(p; M)} is open for each real 
number r > 0. 


Proof. Let r >0 and po € U;. We prove in three steps that there exists 
a 6 > 0 such that r+ 6 < inj(p; M) for every p € M with d(p,po) < 0. 


Step 1. There exists ad >0 such that B,+5(p) C Vp for every pe M 
with d(p,po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then 
there exist sequences p; € M and vu; € Tp, M \ V; such that d(pi,po) < 1/i 
and |u;|< r+ 1/7. Passing to a subsequence, if necessary, we may assume 
that vj; converges to a vector v9 € Tp, M. Then |vo| < r and so (po, vo) € V. 
Since (p;,v;) € TM \ V converges to (po, v9), this contradicts the fact that V 
is open. This proves Step 1. 


Step 2. There exists a 6 > 0 such that the map exp, : B,+5(p) + M is a 
local diffeomorphism for every p € M with d(p, po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then 
there exist sequences pj € M and uy; € T,,M such that d(p;,po) < 1/i and 
|u;| < r+ 1/i, and the derivative dexp,,(vi) : Tp, M — Texp,, (vi) is not 
injective. Passing to a subsequence, if necessary, we may assume that v; 
converges to a vector Ug € T,,M. Then, by smoothness of the exponential 
map, the derivative 


dexp,, (00)? Loy = Tes. (yt 


EXPpy 


is not injective. Since |vo| <r, this contradicts the fact that r < inj(po; M). 
This proves Step 2. 
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Step 3. There exists a 6 >0 such that the map exp, : B,+5(p) > M is 
injective for every p € M with d(p,po) < 6. 


Suppose, by contradiction, that such a constant 6 does not exist. Then there 
exist sequences pj € M and u;, vu; € Tp, M such that 
d(p;, po) < 1/2, Jus] <<r+1/i, ju|<rt+1/i 
and 
Ui F Ui, €XPp, (Ui) = EXPp, (Vi) = Gi- 


Passing to a subsequence, if necessary, we may assume that the limits 
uo = lim wu; € T,,M, vo := lim vy; € Tp,M 
Ioo 100 
exist. These limits satisfy 
Juo| <1, luo] <1, €XP,, (Uo) = EXP, (vo). 


Since r < inj(po; M), this implies up = vp and hence 


lim u; = lim v; = vo. 
100 1-00 


Now define 


Ui Uy 
Wipi= — € 1M, Tj = |vi — ui| > 0. 


a 


Passing to a further subsequence, we may assume that the limit 
wo := lim w; € Tp) M 
1 CO 
exists. Then |wo| = 1 and hence 


0= lim Xp, (u;) — EXPy, (us) 


1-00 Tj 


This contradiction proves Step 3. Now choose a constant 6 > 0 that satisfies 
the requirements of Step 1, Step 2, and Step 3. Then, for every pe M 
with d(p,po) < 6 we have B,+5(p) C Vp and the map exp, : B,+5(p) + M 
is an diffeomorphism onto its image. Hence inj(p; M) > r+ 06> r for every 
element p € M with d(p,po) < 6, and this proves Lemma 7.2.2. 
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Lemma 7.2.3. The set A, := {p € M |r < inj(p; M)} is closed for each real 
number r > 0 and hence also for r = oo. 


Proof. Let p; € A; be a sequence that converges to an element p € M. We 
prove in five steps that r < inj(p; /). 


Step 1. B,(p) C Vp. 


Choose a tangent vector v € T,M with |v| < r, choose a constant ¢ > 0 such 
that |v| + © <r, and choose an integer i such that d(p,p;) < ¢. Define 


k= exPp, (Bioj+e (Pi) = {q EM | d(q,pi) < |v] + e} : 
Here the second equality follows from Theorem 4.4.4 and the fact that 
jul t+e<r<inj(p;;M). By definition, Kk is the image of a compact set 
under a continuous map, and so is a compact subset of M/. Hence 


K := {(q,w) €TM | d(q,pi) < |v] +, wl = lol} 
is a compact subset of 7’. We claim that v € V,. Suppose, by contradic- 


tion, that this is not the case. Then J, (0,00) = [0,T) withO<T <1. 
Denote by y : [0,T’) + M the geodesic y(t) := exp,(tv). Then 


d(y(t), pi) < dlexp,(tv), p) + d(p, pi) < [to] te < lol +e 


and |¥(t)| = |v|, and hence (y(t),4(t)) € K for 0<t<T. By Lemma 4.3.3 
and Corollary 2.4.15, this implies that there exists a constant 6 > 0 such 
that [0,7 +6) C Ip», in contradiction to the definition of T. This contra- 
diction shows that our assumption that v is not an element of V, must have 
been wrong. Thus v € V, and this proves Step 1. 


Step 2. Letq € M such that d(p,q) <r. Then there exists a vector v € T,M 
such that |v| = d(p,q) and exp,(v) = q. 


Since p; converges to p, we have lim;...d(pi,q) = d(p,q) < r and so 
there exists an integer ig € N such that d(pi,q) <r for all i> io. Then, 
for each i > io, there exists a tangent vector v; € T,,M that satisfies the 
conditions exp,,(vi) = q and |v;| = d(pi,q) < r (Theorem 4.4.4). Passing to 
a subsequence, if necessary, we may assume that the limit v := limj_-,. uj 
exists in R”. Then v € T,M, |v| = limy+.o|v4| = limjyoo d(pi, g) = d(p, 9), 
so v € B,(p) C Vp and exp,(v) = limi. exp,, (vi) = g. This proves Step 2. 
Step 3. Let v €T,M such that |v| <r. Then d(p, exp,(v)) = |u|. 

Define g := exp,(v), so d(p,q) < |v] < r. Choose a sequence v; € Ty, M such 
that |v;| = |v] for all 7 and lim;_,~. v; = v. Then the sequence q; := exp,,(v;) 
converges to q = exp,(v) and satisfies d(p;,q:) = |v;| by Theorem 4.4.4. 
Hence d(p, ¢) = limj+o0 d(p;, G) = limj_-+o0|v;| = |v| and this proves Step 3. 
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Step 4. The map exp, : B-(p) + M is a local diffeomorphism. 


Let v € B,(p) and let 0 € T,M be a nonzero vector. Choose any real num- 
ber 1 < A < r/|v| so that |Av| < r and define the geodesic ¥ : [0, A] + M and 
the vector field X € Vect(7) by 


0 es Pa 
y(t) := exp, (tv), AG) = 7s exp, (¢(v + sv)) = dexp,(tv)t® 
s=0 
for 0 <t< A. By Lemma 6.1.17, X is a Jacobi field along yy and 
X(0) = 0, VxX(0) =0 70, X(1) = dexp,(v)v. 


Since L(y) = d(p,exp,(Av)) = d(y(0),y(A)) by Step 3, it follows from 
part (i) of Theorem 7.1.2 that the geodesic y : [0,A] + M has no con- 
jugate points 7 in the open interval 0 < 7 < 4. In particular, 7 = 1 is not 
a conjugate point, and so X(1) 40. Hence the derivative dexp,(v) of the 
exponential map is bijective at every point v € B,(p) and this proves Step 4. 


Step 5. The map exp, : B-(p) + M is injective. 


This is a covering argument. Let vo,vi € B,(p) with exp,(vo)) = exp(p1), 
choose a smooth path v : [0,1] > B,(p) such that v(0) = vp and v(1) = v4, 
and define 7(t) := exp,(v(t)) for 0 < t <1, so (0) = y(1) = g := exp, (vo). 
Choose p < r and i € N such that |v(t)| < p for all t and d(p;,p) < r—p. 
Then d(p;,y(t)) < d(pi, p) + d(p, y(t)) < d(pi,p) + p < r for all t. Define 


B(s,t) = expp, (sexpy,'(@) + (1-s)exp,(7(®)), OS 8, <1. 
This map takes values in the set U,.(p) = {p’ € M|d(p,p’) < r} and satisfies 


B(0,t) = y(t) =exp,(ot)), BUH =¢q for0<t<1, 


and 6(s,0) = B(s,1) = q for all s. Since the map exp, : B,(p) — U,(p) 
is surjective by Step 2 and a local diffeomorphism by Step 4, a path lifting 
argument shows that there exists a smooth map u : [0,1]? + B,(p) such that 


u(0,£) = u(é), exp,(u(s,t)) = 8(s, t) 


for 0 < s,t < 1. This map satisfies u(s,0) = vo and u(s,1) =v; for all s 
and, moreover, the curve t +> u(1,t) must be constant. Hence vp = v1. This 
proves Step 5 and Lemma 7.2.3. 


Proof of Theorem 7.2.1. The set {p € Mla < inj(p;M) < b} = U, \ Ap 
is open for all nunbers 0 <a <6 < coo by Lemma 7.2.2 and Lemma 7.2.3. 
Hence the function M — (0, co] : p> inj(p; M) is continuous. 
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7.3. The Group of Isometries* 


This section is devoted to the Myers—Steenrod Theorem which asserts that 
the group Z(M) of isometries of a connected smooth Riemannian mani- 
fold M admits the natural structure of a finite-dimensional Lie group [52]. 


7.3.1 The Myers—Steenrod Theorem 


Assume throughout that M CR” is a nonempty connected smooth m- 
manifold. To state the main result, it is convenient to introduce the space 


pqe€M and ®:T7,M +>T,M 
is an orthogonal isomorphism 


G:= {(a®0) (7.3.1) 


This space is a groupoid, i.e. a category in which every morphism is an 
isomorphism. The space of objects is the manifold M, the morphisms 
from p € M to qg€ M are the triples of the form (q, ®,p) € G, the identity 
morphism from p to itself is the triple (p, 1, p), the inverse of (q, ®, p) € G is 
the triple (p, ®~!, q), and the composition of a morphism (q, ®, p) € G from p 
to q with a morphism (r, V,q) € G from q to r is the triple (r, V®, p). 

The space G is a smooth manifold (in the intrinsic sense). To see this, 
consider the diagonal action of the orthogonal group O(m) on the prod- 
uct of the orthonormal frame bundle O(M) with itself (Definition 3.4.3). 
This action is free and the map 7: O(M) x O(M) > G which sends the 
pair ((q,e’),(p,e)) to the triple (q,e’oe~',p) € G descends to a bijection 
from the quotient (O(M) x O(M))/O(m) to G. By Theorem 2.9.14 there is 
a unique smooth structure on G such that the map 7: O(M) x O(M) + G 
is a submersion. With this structure the maps s,t: G — M defined by 


s(q,®,p):=p, —-t(q, ®, p) = 
are smooth, the map e: M —> G defined by 
e(p) := (p, 1p) 
is an embedding, the inverse map 7: G — G defined by 
i(q, ®, p) := (p, ®", 4) 


is a smooth involution, the map s x t: G x G + M x M is a submersion, 
and the composition map m : (s x t)~'(A) > G defined by 


m((r, ©, q), (a, ®,p)) := (r, U®, p) 


is smooth. (Here A := {(q,q)|q € M} is the diagonal in M x M.) These 
properties assert that the tuple (G,s,t,e,7,m) is a smooth groupoid. 
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For each p € M define 


Gp = { (a8) 


This space is a submanifold of G because s: G — M is a submersion. The 
space G, is also a smooth submanifold of R” x £(T,M,R") and is dif- 
feomorphic to the orthonormal frame bundle O(M) Cc R” x R"*™ via the 
map G, + O(M): (q,®) + (¢q,®oe) for any orthonormal frame e of T,M. 
Since M is connected, Lemma 5.1.10 asserts that the map 


(7322) 


is an orthogonal isomorphism 


q¢€ M and ®:7,M > T,M } 


lp: T(M) Gp, (6) == (0(p), dd(p)), (7.3.3) 
is injective for every p € M. Denote the image of this map by 
Tp = tp(Z(M)) = {(6(p), do(p)) |6 € Z(M)} C Gp. (7.3.4) 


In the following theorem we do not assume that M is complete. For a space 
of smooth functions or maps on M we use the term C™ topology to mean 
the topology of uniform convergence with all derivatives on each compact 
subset of M. Likewise we use the term C° topology to mean the topology of 
uniform convergence on each compact subset of M. The latter is also called 
the compact-open topology (because a basis of the topology consists of 
sets of maps, one for each compact subset of the source and each open subset 
of the target, that send the given compact subset of the source into the given 
open subset of the target). 


Theorem 7.3.1 (Myers—Steenrod). Let MCR” be a nonempty con- 
nected smooth m-manifold. Then the following holds. 


(i) There exists a unique smooth structure on the isometry group I(M) such 
that the map lp: I(M) > Gp in (7.3.3) is an embedding for every pe M. 


The topology induced by this smooth structure agrees with the C° topology 
and with the C® topology on I(M) and dim(Z(M)) < m(m + 1)/2. 
(ii) With the smooth structure in part (i) the maps 
T(M) x T(M) > T(M) : (, 6) 4 poe 
and 
T(M) 3 I(M):¢4¢1 
are smooth. Thus I(M) is a finite-dimensional Lie group. 


(iii) The Lie algebra of I(M) is the space VectK<(M) of complete Killing 
vector fields (defined below). 


Proof. See page 356. 
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7.3.2 The Topology on the Space of Isometries 


The next lemma shows that for each p € M the set Z, in (7.3.4) is a closed 
subset of G, and that the map tp : Z(M) — TZ, in (7.3.3) is a homeomorphism 
with respect to the C™ topology on Z(M). 


Lemma 7.3.2. Fix two elements po,qo € M, let ®o : Tp, M — Ty,M be an 
orthogonal isomorphism, and let d;: M — M be a sequence of isometries. 
Then the following are equivalent. 


(i) The sequence tp) (¢i) € Ip) converges to the pair (qo, Po) € Gyo, 4-€. 
lim ¢;(p0) = 4, lim d¢j(po) = ®o. (7.3.5) 
1 0o ICO 


(ii) There exists an isometry 6: M — M such that $(po) = qo, d¢(po) = ®o, 
and @; converges to ¢ in the C™ topology. 


Proof. That (ii) implies (i) follows directly from the definitions. We prove in 
three steps that (i) implies (ii). For p € M and r > 0 denote by V, C T,M 
the domain of the exponential map of M at p (Definition 4.3.5), and de- 
fine U;(p) := {q € M|d(p,q) < r} and B,(p) := {v € T,M | |v| <r}. 


Step 1. Assume (7.3.5) holds and choose a real number 0 < r < inj(po; M). 


Then r < inj(qo;M) and ¢; converges on the open set U;(po) in the C® 
topology to the isometry $9 := exPg, 2 Bo © eXPp,_ : U;(po) + Ur(qo).- 


Since r < inj(po; M), Corollary 5.3.3 asserts that 
Br(Gi(po)) = 4¢i(Po)Br(po) C 4bi(Po)Vpo © Voi (po) 
and $j © €XPp, = EXP}; (pp) 0 Pi : Br(po) + U-(di(po)). Thus the map 
©<P oi(p0) = pi © EXPpo © o;' : B,($i(po)) = U;(¢i(po)) 


is a diffeomorphism and so r < inj(¢;(po); M) for each 1 € N. Since ¢;(po) 
converges to qo, this implies r < inj(qo; WZ) (Lemma 7.2.3). Hence the se- 


quence of maps exp, (p)) 0; converges to the map exp,, 0° ®o in the C™ 
topology on B,(po9). Hence the sequence of isometries 


Pi = EXP}, (pg) 0 Bi © eXPp,. : U,(po) + M 
converges in the C™ topology to the diffeomorphism 


go := €XPgqo © Pp 0 expy,, : U;,(po) =? U;(qo): 
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The map ¢o satisfies 
d(Go(P), G0(4)) = Jim d(@i(p), Gi(q)) = d(v, @) 


for all p,q € U;(po) and hence is an isometry (Theorem 5.1.1). This proves 
Step 1. 


Step 2. The sequence ¢; converges in the C® topology on all of M to a 
smooth map @6:M > M. 


Define the set Mo := {p € M | the sequence (¢;(p), d¢i(p)) converges}. This 
set is nonempty because po € Mp. We prove that Mo is open. Fix any 
element p € Mo and define q := limj_,~ ¢;(p) and ® := lim; 4. d¢;(p). 
Choose a real number r such that 0 < r < inj(p;M). Then Step 1 as- 
serts that r < inj(q;M) and the sequence @;|y,(p) converges to the diffeo- 
morphism exp, 0 ®o exp, | : U;(p) > U;(q) in the C® topology on U;(p), 
and hence U;(p) C Mo. This shows that Mo is open, that |, converges 
in the C™-topology to a smooth map ¢@: Mo > M, and that ¢(Mp) is an 
open subset of M. 

We prove that Mo is closed. Let p, € Mo be a sequence that con- 
verges to an element p € M. Then there exists a real number r > 0 such 
that r < inj(p,;M) for all v. Hence B,(p,) C Mo for all v by Step 1. 
Choose v so large that d(p,,p) < r to obtain p € Mo. This shows that Mp is 
closed. Since M is connected, we deduce that Mp = M. This proves Step 2. 


Step 3. The map 6: M > M in Step 2 is an isometry. 
We claim that the triple po, qo, ®o in (7.3.5) satisfies 


lim $7" (qo) = po, lim dd; '(qo) = 8". (7.3.6) 
i-0o 100 


Namely, the sequence d(¢; (40), po) = d(qo, ¢i(po)) converges to zero by as- 
sumption, and by Step 1 the derivatives dé; converge to d@ uniformly in 
some neighborhood Up C M of po. Since ¢;, (qo) € Up for 7 sufficiently 
large, this implies that the sequence d¢;(¢; 1(qo)) converges to d@(po) = ®o 
and hence the sequence d¢;(¢; (gg)? = dd; ‘(qo) converges to Dp. This 
proves (7.3.6). By (7.3.6) and Step 2 the sequence ¢; ' converges in the C™ 
topology to a smooth map w:M —> M. By uniform convergence on com- 
pact subsets of M we have wog=id and doy =id, sod:M> Misa 
diffeomorphism. Moreover, since ¢; is an isometry for each 7 and converges 
pointwise to ¢, we have d(¢(p), (¢)) = limi+co d(¢i(p), d:(q)) = d(p, q) for 
all p,q € M, so ¢ is an isometry. This proves Step 3 and Lemma 7.3.2. 
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The next goal is to verify that the spaces TZ, in (7.3.4) are diffeomorphic 
to each other. For po,pi € M define the map Fp, ») : Zp) — Lp, by 


Fp1,po(P(Po), 4b(p0)) = (P(p1),db(p1)), = 6 E T(M). (7.3.7) 


This map is well-defined by Lemma 5.1.10, because M is connected. Col- 
lectively, these maps give rise toa map F: M x T > ZT defined by 


L := {(¢(p), d¢(p),p) |p € M, 6E T(M)} CG, 


F (pi; ($(Po),; d6(p0), Po)) = (b(p1), do(p1), P1) 


for po,p1 € M and ¢ € T(M). The next lemma uses the concept of a smooth 
map on an arbitrary subset of Euclidean space as in the beginning of §2.1. 


(7.3.8) 


Lemma 7.3.3. The map F: M x I > T is smooth. In particular, for each 
pair of points po,pi1 € M, the map Fp,» : Zp) — Lp, is a diffeomorphism. 
Proof. Let po,pi € M and choose a smooth path y: J = [0,1] ~ M with 
the endpoints 7(0) = po and y(1)=pi. Let Uy CG, be the set of all 
pairs (qo, ®o) € Gp) such that there exists a development (®,7, 7’) on the 
interval J that satisfies 

¥(0)=q,  (t) =o. (7.3.9) 
By Remark 3.5.22, the set U/, is open in G,, and the map U, — G that sends 
the pair (qo, ®o) € Uy to the pair (7’(1), ®(1)) € Gp, is smooth. This shows 
that there is a unique diffeomorphism 

Fitba, ORV l=, UyCGy, Ua c Gp, 


that satisfies the condition 


Fy (7'(0), ®(0)) = (7/1), (4) (7.3.10) 
for every development (®,y,7) of M along M on the interval J. The in- 
verse of F, is the smooth map F,-1:U,-1 +U,. If ¢e T(M), then by 
Lemma 6.1.11 there exists a development (®,7, 7) on I satisfying the ini- 
tial conditions 7/(0) = 6(po) and ®(0) = d¢(po), and it is given by 


V(H) =o), — ®) = dg((4)) (7.3.11) 
for t € I. Hence (@(po), d¢(po)) € Uy and by (7.3.10) and (7.3.11) we have 


F(6(po), 46(po)) = (¢(P1), 46(P1)) = Fr. ((P0), 4(P0)) 


for every 6 € Z(M). Thus Z,, CU, and Zp, CU,-1 and Fy|Zno a ee 
Hence Fp, ») is smooth. The smoothness of F follows from the smooth 
dependence of the map F, on the curve y, the verification of which we leave 
to the reader. This proves Lemma 7.3.3. 
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7.3.3 Killing Vector Fields 


Killing vector fields are defined as those vector fields on M whose flows 
are one-parameter families of isometries. Assume throughout that M is a 
nonempty connected smooth m-dimensional submanifold of R”. Let X be 
a vector field on M and denote by 


RxM>D>M:(t,p) 4 d(t,p) = ¢'(p) 


the flow of X (Definition 2.4.8). Then Theorem 2.4.9 asserts that D is 
an open subset of Rx M and @¢ is smooth. Thus, for every t € R, the 
set D; := {p € M | (t,p) € D} is open in M and the map ¢!: Di > D_; isa 
diffeomorphism with the inverse ¢~¢ : D_; > D,. 


Lemma 7.3.4. In this situation the following are equivalent. 
(i) For every t € R the diffeomorphism ¢* : Dy + D_, is an isometry. 


(ii) For every p € M and every pair of tangent vectors u,v € T,M, we have 
(VuX(p),v) + (u, VoX(p)) = 0. (7.3.12) 
Proof. Let p¢€ M and v € T,M. Choose a smooth curve a: R— M such 
that a(0) = p and a(0) = v, let Q := {(s,t) € R?| (s,a(t)) € D}, and define 
the map 7:2 — M by 7(s,t) := ¢'(a(s)) for (s,t) € Q. Then 
Osy(0, t) = dg‘ (p)v, vy =Xo of VsOr7Y = VosyX (4). 
Thus the formula V,0,y = V;0¢7 in Lemma 3.2.4 shows that 
Vdd" (p)v = Vasey X ($"(0)) (7.3.13) 


for all ¢ € R and this implies 


i |a"(p)o|” = 2 (Vass ryX ("(P)), 4b"(p)v) « (7.3.14) 


The right hand side vanishes for all p, v,¢ if and only if X satisfies (i), and the 
left hand side vanishes for all p,v,t if and only if the flow of X satisfies (ii). 


This proves Lemma 7.3.4. 


Definition 7.3.5. A vector field X € Vect(M) is called a Killing vector 
field (named after Wilhelm Karl Joseph Killing [39]) iff it satisfies equa- 
tion (7.3.12) for allp € M and allu,v €T,M. The space of Killing vector 
fields will be denoted by Vectk(M). 
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The next lemma shows that the space of Killing vector fields is a finite- 
dimensional Lie subalgebra of the Lie algebra Vect (MV) of vector fields on M. 
Part (ii) is the linear counterpart of Lemma 5.1.10. 


Lemma 7.3.6. Let M C R” be a connected smooth m-manifold. Then the 
following holds. 

(i) If X is a Killing vector field on M andy: M — M is an isometry, then 
the pullback w*X is a Killing vector field. If X and Y are Killing vector 
fields on M, then so is their Lie bracket [X,Y]. 

(ii) If X is a Killing vector field on M and there exists a po € M such that 


X(po)=0,  VX(po) = 9, (7.3.15) 


then X(p) =0 for allpe M. 
(iii) If M is complete and X is a Killing vector field, then X is complete. 


Proof. We prove part (i). If 6! : Dy + D_ is the flow of X, then 
plodop:p (Di) +o '(D4) 


is the flow of ~*X. Hence the flow of w*X is a one-parameter family of 
isometries by Lemma 7.3.4 and so, again by Lemma 7.3.4, ¢*X is a Killing 
vector field. Now assume that both X and Y are Killing vector fields and 
denote by ~' the flow of Y. Then each y” is an isometry by Lemma 7.3.4 
and [X,Y] = an (W)*X by Lemma 2.4.18. Hence the pullback (q)*X 
is a Killing vector field for each t by what we have already proved, and so 
is their limit as t tends to zero. This proves (i). 

We prove part (ii). Let 6’: D; > D_; be the flow of X. Then it follows 
from equations (7.3.13) and (7.3.15) that ¢'(po) = po and d¢‘(po) = id for 
all t. By Lemma 5.1.10 this implies that the isometry ¢; :D, > D_; is the 
identity for every t€ R. Thus D; = D_; = M for all t and X(p) = 0 for 
all p € M. This proves (ii). 

We prove part (iii). Thus assume that M is complete and X is a 
Killing vector field. Let y : I — M be an integral curve of X on its 
maximal existence interval J C R and denote p := y(0). Differentiate the 
equation ¥(t) = X(7(t)) to obtain Sf H(O)/? = 2(V54)X (y(#)), V(t)) = 0 for 
all t € I. Hence the function J > R:t+> |4(t)| is constant and this implies 
the inequality d(p,7(t)) < fs) ds = t|X(p)| for allt € J. Since M is 
complete, the closed ball of radius R about p is compact for every R > 0 
(Theorem 4.6.5). Hence the restriction of 7 to any bounded subinterval 
of I is contained in a compact subset of M amd by Corollary 2.4.15 this 
implies J = R. This proves (iii) and Lemma 7.3.6. 
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The proof of Theorem 7.3.1 is somewhat analogous to the proof of the 
Closed Subgroup Theorem 2.5.26. The first parallel is in part (i) of the next 
lemma, which asserts that the space of complete Killing vector fields is a Lie 
subalgebra of Vect(J/) and can be viewed as an anlogue of Lemma 2.5.28. 
Part (ii) is the analogue of Lemma 2.5.27. 


Lemma 7.3.7. (i) The set Vectk,-<(M) of complete Killing vector fields 
on M is a Lie subalgebra of Vecth(M). 


(ii) Let Rx M > M : (t,p) Yi (p) be a smooth map such that yy is an 
isometry for every t € R and define X (p) := Flo Ur(p) forp € M. Then X 
is a complete Killing vector field. 


Proof. The proof has three steps. 


Step 1. Let U,V C M be nonempty open sets, let db: U > V be an isometry, 
and let dé, : M > M be a sequence of isometries that converges uniformly 
on every compact subset of U to @. Then ¢ extends uniquely to an isometry 
from M to M and $x converges in the C™® topology to this extension. 


Fix an element po € U. Then by part (i) of Exercise 5.1.11 we have 
jim, x(Po) = o(p0), lim d¢j,(po) = 4¢(po). 
— 00 k-> 00 


Hence the assertion of Step 1 follows from Lemma 7.3.2. 
Step 2. We prove part (ii). 


That X is a Killing vector field follows from the same argument as in 
Lemma 7.3.4 with time dependent vector fields. Denote by ¢! : D; > D_4 
the flow of X. Then the sequence of isometries bee : M— M converges 


to ¢ uniformly on every compact subset of D; (see Exercise 7.3.10 below). 
Hence Step 1 asserts that ¢' extends to an isometry on all of M when- 
ever D; is nonempty, in particular for small t. The extended isometries still 
satisfy 6°** = $8 o ¢' and Od; = X od’. Thus D; = M for small t and so for 
all t, because 6 *(D_,M Dz) C D541 (Theorem 2.4.9). This proves Step 2. 


Step 3. We prove part (i). 


Let X,Y € Vectx,-(M), let ¢' be the flow of X, and let 7)" be the flow of Y. 
Then an ¢' oy'(p) = X(p)+ Y(p) for all p € M, and so X + Y is a com- 
plete Killing vector field by Step 2. Hence Vect «,-(/) is a vector space. It is 
finite-dimensional by Lemma 7.3.6. Moreover, (W~'0¢% 0") cp is the flow of 
the pullback vector field (w')*X, hence (7")*X is a complete Killing vector 
field for each t € R, and hence so is the Lie bracket [X,Y] = Flexo (pty*X 
by finite-dimensionality. This proves Step 3 and Lemma 7.3.7. 
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Exercise 7.3.8. Let MC R” be a smooth manifold and let X be a vector 
field on M. Consider the following condition. 


(K) Ify: I — M is a geodesic, then X oy € Vect(y) is a Jacobi field along y, 
i.e. it satisfies the differential equation VV(X oy) + R(X 07,4)7 = 0. 


Prove that every Killing vector field satisfies (K) and use this to give an 
alternative proof of part (ii) of Lemma 7.3.6. If M is compact, prove that 
a vector field satisfies (K) if and only if it is a Killing vector field. Find a 
vector field on a noncompact manifold that satisfies (K) but is not a Killing 
vector field. Hint: Differentiate the function t > (X(y(t)), y(t)) twice. 


Exercise 7.3.9 (Gr6énwall’s inequality). [f a,c > 0 ando: [0,T] > R is 
a continuous function satisfying 
t 
0<o(t)<at cf a(s) ds jorOQataT, (7.3.16) 
0 


then a(t) < ae™ for every real number 0 <t <T. Hint: The function 


t 
Fi) = | (ae® — o(s)) ds 
0 
satisfies 7(t) > cr(t) for all t. Differentiate the function t 4 e~“r(t) to show 
that 7 is nonnegative and nondecreasing. 


Exercise 7.3.10. Let T,c,e¢ be positive real numbers, let M Cc R” bea 
smooth m-dimensional submanifold, and let [0,T|x M — R” : (t,p) H X¢(p) 
and [0,7] x M — R”: (t,p) 4 Yi(p) be continuous maps that satisfy the 
conditions X;(p), Y:(p) € TpM and 


Xi(p) —Yi(p)|pn Se, | Xt(p) — Xt (Q) lpn < clp — apn (73:17) 


for all p,q € M and allt € [0,7]. Ifthe curves 3, : [0,7] — M are solutions 
of the differential equations 3(t) = X;(G(t)) and ¥(t) = ¥:(y(t)), prove that 


|B(4) — ¥(¢)le» < (180) - (0) ew + Te) e% (7.3.18) 


for 0 <t< TJ. Relax the continuity requirement in ¢ on the vector fields to 
piecewise continuity. Hint: Define the function o : [0,7] — R by 


a(t) = |B(t) — 7(t)IRe- 
Show that ; 
o(t) <o(0)+te+ ef a(s) ds 
0 


and use Grénwall’s inequality. 
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7.3.4 Proof of the Myers—Steenrod Theorem 


With these preparations we are ready to prove Theorem 7.3.1. The following 
lemma is the heart of the proof. It is the analogue of Lemma 2.5.29 in the 
proof of the Closed Subgroup Theorem 2.5.26. 


Lemma 7.3.11. Fiz an element po € M, a tangent vector vp € T,,M, and a 
linear map Ao : Ty, M — Tp,M. Let ¢; € I(M) be a sequence of isometries 
and let 7; be a sequence of positive real numbers such that 


lim 4i(p0) = po, _ lim d¢i(po) = 1, (7.3.19) 
i-0o 100 
and, for allv € T,,M, 
on i(Po) = Po _ lim doi(po)v — v 


41-00 Ti 1-00 Ti 


Then there exists a unique complete Killing vector field X on M such that 


X (po) = V0; dX (po) = Apo. (7.3.21) 
For every p€ M and every v € T,M this vector field satisfies 
i(p) — dd; (p)v — 
fi EP yi EY a. (7.3.22) 
400 Ti 4-00 Ti 


and the convergence is is uniform on compact subsets of TM. 
Proof. By (7.3.19), (7.3.20), and Exercise 2.2.4 we have 
(Po, Ao) € T(po,1)Gpo- 


Let p € M and recall the definition of the map Fp», : Zp) > Zp in (7.3.7). 
By Lemma 7.3.3 this map extends to a diffeomorphism F, from an open sub- 
set Uy C Gp, containing the set Z,, to an open subset U/,-1 C Gp, containing 
the set Z, and it satisfies F(po, 1) = (p, l). Define 


(X(p), A(p)) = dF, (po, 1)(v0, Ao) € T(p,1)Gp- 


Since F,(¢;(po), ddi(po)) = (di(p), dd; (p)) for all i, it follows from (7.3.20) 
and Exercise 2.2.16 that, for all v € T,M, we have 


lim $ilp) — P =X (p); lim coe = A(p)v. (22a) 
Io Ti 1 00 Ti 
This formula shows that the pair (X(p), A(p)) € T(p,1)Gp is independent of 
the choice of the extension F,, used to define it. Moreover, it follows from the 
smoothness of the map F in Lemma 7.3.3 that the map p> (X(p), A(p)) 
is smooth and that the convergence in (7.3.23) is uniform as the pair (p, v) 


varies over any compact subset of 7’M. 
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Now let v € T,M and define +(t) := exp,(tv) for t € Ip». Then 


X(4(t)) — X(p) = Jim SOO) = WO _ 5 Pl) —P 


1-400 Ti i oo Ti 


Pil) — Gi((0)) — VE) + (0) 


= lim 


I 
5 


I| 
2 
is) 
Fie 
w 
z= 
= 
Bs 
Q 
W 


0 


Here the last step uses uniform convergence on compact sets in (7.3.23). 
Divide by t and use the continuity of the curve t + A(7(t))+(t) to obtain 
' ; _ X(exp,(tv)) — X(p) 


ail 
A(p)v = lim = ; A(7(s))¥(s) ds = lim ; 


Hence, by Exercise 2.2.16, we deduce that, for all p € M and all v € T7,M, 
A(p)v = dX(p)v. 


By (7.3.20) and (7.3.23) this shows that X satisfies (7.3.21) and (7.3.22). 
By (7.3.22) and Exercise 2.2.4 we have (X(p),dX(p)) € Tip1)Gp and this 
implies (v, dX (p)v) = 0 for all v € T,,M. Here is a more direct proof of this 
crucial fact. Namely, by (7.3.22) we have 


. d¢;(p)u — v|? 
jim weit e = |dX (p)o|’. 


Hence 


(v,dX(p)v) = lim (uv, ddi(p)u — v) 


1-00 Ty 
. = 2 
— tim (2. 46e(0)e) = Io 
100 Ti 
: — 2 . 2 
— tin 22ddil)w) ~ lv? = |adi()o| 
i-00 27; 
— yl2 7. 
= — jim eito)v = ¥P 
1-00 Te; 2 
=0 


for all p € M and all v€ T,M. This shows that X is a Killing vector field 
and so it remains to prove that X is complete. 
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Let ¢! : Dy + D_, be the flow of X and, for p € M, denote by 
Ip := {t ER|pe Dy} 
the maximal existence interval for the solution y of the initial value prob- 


lem ¥(t) = X(y(t)), y(0) = p. We prove in three steps that X is complete. 
Step 1. Let p € M and let T > 0 such that T|X(p)| < inj(p;M). Then 


re i, 


Define 7(t) := $'(p) for t € Ip. Since X is a Killing vector field, the diffeo- 
morphism ¢; : Dy > D_; is an isometry by Lemma 7.3.4, and hence 


LW) = |X (4"())| = ld" (p) X (p)| = |X (0), 


for all t € I,. This implies d(p, y(t)) < t|X(p)| for all t € I,, and hence + 
cannot leave the compact set Up)x(p)\(p) = {¢ € M|d(p,q) < T|X(p)|} on 
any subinterval I C [-T,T]. Hence [-T,T] C I, by Corollary 2.4.15 and 
this proves Step 1. 


Step 2. Letpe M. Ift ER satisfies |tX(p)| < inj(p; M) and the sequence 
of integers m; € Z is chosen such that 


Mii <t< (mi + Lie. (7.3.24) 


then $*(p) = lim;_+o0 ¢;"* (p). 


Let T > 0 such that T|X(p)| < inj(p; MM). Then [0,T] Cc I, by Step 1 and 
the set 
K = {¢'(p)|O<t<T} 


is compact and |X(q)| = |X(p)| for all g€ K. By Lemma 4.2.7 there exists 
a constant 6 > 0 such that, for all g,go,q. € M, 


qe kK, d(q,q) <5, dqau)<sd = dod) <9 (7.3.25) 


ldo — qi 


Fix a real number 0 < € < 1 and choose ig € N such that, for all 7 > io, 
Ti|X(p)| <4, (T+ 7%)|X(p)| < inj(p; M), uD d(q, Oi(q)) < 6, (7.3.26) 
qe 


P(g) -—9 © 
Ti 


X(q)|<e. (7.3.27) 


sup ia a x(a) <€, sup 
L qek 
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Then, for all 1 >%o and all ge K, we have d(q,¢%(q)) < 7i|X(p)| < 6 
and d(q, ¢i(q)) < 6 by (7.3.26), and hence by (7.3.25) and (7.3.27), 


sup d(¢i(q), 9" (@)) < 2 sup |¢i(q) — 6" (9)| < Ate. (7.3.28) 
qek qek 


Now choose a real number 0 <¢<T and choose m; € No as in (7.3.24). 
Then, for k = 1,...,m;—1 we have ¢*"(p) € K and hence 


a(ot*(p), ot" (p)) 
< d(4i(##(p)), 6:(4"*(p))) + d(4i(o"*(p)), 8" (6**())) 
= d( oh (p), 6 (p)) + d(d:(4'™()), 6” (0"*(p)) ) 
< a(ot (p), 0°" (»)) + Ane. 
Here the last inequality holds for i > ig by (7.3.28). By induction this implies 
a6?" (p),8™"(p)) < Amyre < ATs. 


for all 1 > ig. Hence 
lim ¢;"'(p) = lim ¢""(p) = 9"(p) 
Ion ICO 
and this proves Step 2 for t > 0. For t < 0 the argument is similar. 


Step 3. J, =R for everype M. 


Fix an element p € M and real number T > 0 such that T|X(p)| < inj(p; M). 
Choose the sequence m; € No such that tm; < T < (m; + 1)7;. Then Step 2 


asserts that 67 (p) = lim; +. ¢/"'(p). Since ¢/" : M + M is an isometry, it 
follows that T |X (p)| < inj(¢;"(p); M) for all i € N and hence 


T |X (p)| < inj($" (p); M). 
By Step 1 this implies [0,T] C Igri). Hence [0,2T] C I, and, by Step 2, 

§'(p) = lim 97" (07 (P)). 
Continue by induction to obtain for all k € N that 

T |X (p)| < inj(¢"" (p); M) 
and hence [0, (k + 1)T] C I, and 

p07 (p) = lim $7" ($*"(p)). 
1 CO 


Thus [0, 00) C J, and the same argument shows that (—0oo, 0] C Ip. Hence X 
is complete. This proves Step 3 and Lemma 7.3.11. 
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The next lemma establishes the smooth structure on Z(M), in analogy 
to the proof of the Closed Subgroup Theorem 2.5.26. 


Lemma 7.3.12. For every p € M the set I, is a smooth submanifold of Gp 
and its tangent space at (q,®) € Z, is given by 


T(q,)Zp = {(X(q), dX (q)®) | X € Vectx,-(M)}. (7.3.29) 


Proof. By Lemma 3.4.5 with R™ replaced by T,M the tangent space of Gp 


at an element (q, ®) € G, is given by 


T(q,6)Gp = § (@ 9) (1- 1(q))@ =h,(q)®, and . (7.3.30) 
( aa 


du, bv) + (bu, dv) =0V u,v € TM. 


If (¢,®@) € G, and X € Vect(M), then (1 — II(q))dX(q)® = hg(X(q))® by 
the Gauf8—-Weingarten formula in Remark 5.2.5. Moreover, if X is a Killing 
vector field, then (@u, dX (q)@v) + (dX (q)®u, @v) = 0 for all u,v € T,M, so 
it follows from (7.3.30) that the pair (q, ®) with ¢ = X(q) and d= dX (q)® 
is a tangent vector of G, at (q,®). 


Now abbreviate 


k := dim(Vectx,-(M)) < mn +1) 


Fix an isometry ¢9 : M — M and define 
(qo, ®o) = (¢0(p), dbo(p)) = 4p(b0) € Zp. 


We must construct a coordinate chart on G, in a neighborhood of the 
point (qo, ®o) € Zp with values in an open set QC R° that sends T, to the 
intersection of Q with R* x {0}. For this we first choose a basis Yj,..., Y% 
of Vect «,¢(M) and then a basis 7 = (G1, ®1),.--, me = (@, &e) of the tangent 
space Tq, 69)Gp such that 


= dim(Gp) =: & 


ny = (G, %;) = (¥5 (¢0(P)). €¥5 (0(P))d¢o(p)). (een 
Next we choose any smooth map 
Re Ge) Pe) 


such that 1(0) = (qo, ®o) and 
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For a complete Killing vector field X € Vectx.¢(M) let yx € Z(M) be the 

time-1 map of the flow of X and define the map 0: R’ > Gy by 
o(t!,...,t°) = (dx(@),dvx(a)®), 

PY; tee y,. (7.3.31) 

(q, ®) := ACs ore ye) 


p< 
ll 


O(0) = (0) = (go, Bo) = tp(bo) € Zp 
and 
dO (0) (t', Pose i) = tim + Parery + tng 


for (t!,...,t°) € R®. Thus the derivative dO(0) : R’ > T(q,6o)Gp i8 a bi- 
jective linear map and so the Inverse Function Theorem asserts that the 
map O restricts to a diffeomorphism from a sufficiently small open neigh- 
borhood 2 ¢ R* of the origin onto its image (Q) C Gp, which is an open 
neighborhood in Gp of the point O(0) = tp(¢0) € Zp. With this understood, 
the assertion that Z, is a smooth submanifold of G, is a direct consequence 
of the following Claim. 


Claim. There exists an open set Qo C R® such that 
0EN CQ, Q(N0 M1 (R* x {O})) =UNLp, — Uo := O(Mo). (7.3.32) 


Suppose, by contradiction, that such an open set Qo does not exist. Then 
there exist sequences t; = ( aoe pty) € R¢ and ¢; € Z(M) such that 


dimt=0, 4 €2\(R*x {0}), Ot) = H(¢i) € Z. 
Define 
Mest He Gis OP) Se oe) 
Then (x, (qi), dvx, (qi) i) = O(ti) = tp(¢i) = (¢i(p), dgi(p)) and hence 
gi= (Wx! odi)(p), Bi = dbx, (qi) 'ddi(p) = d(Wy! © 4:)(p). 


Thus (Wx od) = (qj, ®;) = 0(0, fak Oe ae i) is still a sequence in 
the set O(Q \ (R* x {0})) NZ, that converges to tp(¢o). Thus we may as- 
sume without loss of generality that t! = ¢? =--- =t* = 0 and so 


U(ti) = tp(i), im tp) = (40). (7.3.33) 
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Then ¢; 9 dp | converges to the identity in the C® topology by Lemma 7.3.2. 
Since ¢; # ¢o0, we have 


74 := |(610¢9')(p) —p| + sup 
OAvETpM |r| 


for all i by Lemma 5.1.10, and lim;_,., 7% = 0. Hence, by Exercise 2.2.4 there 


exists a tangent vector (vo, Ao) € T(p.1)Gp and a subsequence, still denoted 
by ¢;, such that, for all v € T,M, 


i OO ties 
100 Ti too Ti 


It follows from (7.3.35) and Lemma 7.3.11 that there exists a complete 
Killing vector field _X € Vectx,-(M) such that 


X(p)=v,  dX(p) = Ao. 


This vector field is nonzero because (vp, Ag) 4 0 by (7.3.34). Moreover, by 
equation (7.3.22) in Lemma 7.3.11, with ¢; replaced by ¢; 0 bo - and the 


pair (p,v) replaced by the pair (¢o(p), déo(p)v), we obtain 


X(do(p)) = lim Li?) = Pole) 


1-00 Ti 


dX ($0(p))ddo(p)v = lim dgi(p)v — dgo(p)v 


oe) Ti 


for v € T,M. Since by (7.3.33) the sequence 
(Gi(p), di(p)) = tp(Gi) = e(ta) 


converges to (¢9(p),d¢o(p)) = 4(0), the pair (X(¢0(p)), 4X (¢0(P))d¢go(p)) 
belongs to the image of the derivative di(0) by Exercise 2.2.4, and hence 
is a nonzero linear combination of the vectors ,41,... wm. It is also a 
linear combination of the vectors 71,...,7,, because X is a complete Killing 
vector field and so is a linear combination of the vector fields Yj,...,Y¥z. 
This is a contradiction, and this contradiction proves the Claim. Thus there 
does, after all, exist an open set Qo C R° that satisfies (7.3.32), and the 
map O07! : Up — Qo is then a coordinate chart on G, which satisfies 


O71(Up N Tp) = M9 M (R* x {0}). 


Hence Z, is a submanifold of G, and this proves Lemma 7.3.12. 
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Proof of Theorem 7.3.1. By Lemma 7.3.3 and Lemma 7.3.12 there exists 
a unique smooth structure on Z(M) such that the map tp, : Z(\M) > Gp 
in (7.3.3) is an embedding for every p€ M. That the topology induced 
by this smooth structure agrees with the C™® topology was established in 
Lemma 7.3.2. That it also agrees with the compact open topology (i.e. with 
the C° topology of uniform convergence on conpact sets) is the content of 
Exercise 5.1.11. This proves part (i). 
We prove part (ii). Recall the definition of the map 


F:MxI-T 


in (7.3.8) and the target map t: G + M, the inverse map i: G > G, and the 


composition map m: (s x t)~'(A) > G in the beginning of §7.3.1. These 
maps are all smooth and turn Z into a smooth subgroupoid of G. For 
each p € M they endow the submanifold Z, C G, with the structure of a Lie 
group as follows. The unit is the element 


€p := (p, 1) € Z (7.3.36) 


The product is the map mp : Zp x Zp —> Zp defined by 


(mp(n, €),p) := m( F(t(é.P), (n,p)), (<,r)) (7.3.37) 


for €,7 € Z,, and the inverse map is the involution 7, : Z, — Z, defined by 


(ip(€),p) -= F(p, i(€,p)) (7.3.38) 
for € € Z,. The maps mp, in (7.3.37) and i, in (7.3.38) are smooth by 


definition. To show that they define a group structure and that the map 


tp: T(M) > T 


in (7.3.3) is a group isomorphism, we must verify the identities 


inlid) = ey, 
Mp(tp(), ep(G)) = tpl © 4), (7.3.39) 
ip(tp($)) = p($") 


for all 6,W € Z(p). The first equation in (7.3.39) follows directly from the 
definitions. To prove the remaining equations, fix two isometries ¢ and w 
and define €,7,¢ € Z, by 


€ := (9), 1) = by(W), C = Up(p og). 
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Then, by the chain rule, 


(¢,p) = (v(o@)), 44(4(p))46(P), P) 


I 
= 
a 
— 
= 
= 
= 
Es 
= 
= 
= 
pS 
sS 
= 
~~ 
a 
— 
~ 
= 
= 
3 
ma 
nn" 


= m(F(6(p), (W(P), @(P),P)), (6), €6(P),P)) 
= m( F(t(é,p), (n,p)), (EP 
= (mp(n, €), Pp): 


Here the last equality follows from the definition of the map mp in (7.3.37). 
This proves the second equation in (7.3.39). 


To verify the third equation in (7.3.39), we compute 


(tp(6'),p) = (6 '(p), dd" (p), v) 


Here the last equality follows from the definition of the map i, in (7.3.38). 
This proves the third equation in (7.3.39) and part (ii). 


That the Lie algebra of Z, is isomorphic to the space Vectx,-(M) of 
complete Killing vector fields under the Lie algebra isomorphism 


Vect K,<(M) = TepLp eS (X(p), dX (p)) 


follows from Lemma 7.3.12. This proves part (iii) and Theorem 7.3.1. 


Corollary 7.3.13. Let (p,e) € O(M). Then there exists a constant ¢ > 0 
with the following significance. If 6: M > M is an isometry that satisfies 


d(p, 0(p)) <€, le ~ db(p)e|anxm <6, 


then there exists a complete Killing vector field X € Vect.<(M) whose flow 
has the time-1-map wx = ¢. 


Proof. Use the construction of the map 0: 2 — G, and the Claim in the 
proof of Lemma 7.3.12 with ¢9 = id to obtain @ € O(N (R* x {0})). 
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7.3.5 Examples and Exercises 


Throughout we denote by Zp(M) C Z(M) the connected component of the 
identity in the group of isometries of a Riemannian manifold M. 


Example 7.3.14. The isometry group of R’” is the group of all affine trans- 
formations of R™ with orthogonal linear part (Exercise 5.1.4). 


Example 7.3.15. We have seen in Example 6.4.16 that the isometry group 
of the m-sphere S$” C R™*! is the group Z(S™) = O(m + 1) of orthog- 
onal transformations of the ambient space. In the case m = 2 a theorem 
of Smale [72] asserts that the inclusion O(3) = Z($?) < Diff(S?) of the 
isometry group of the 2-sphere into the infinite-dimensional group of all 
diffeomorphisms of the 2-sphere is a homotopy equivalence. 


Example 7.3.16. In 86.4.3 we have introduced the hyperbolic space H”™. Its 
isometry group is the group Z(H”) = O(m, 1) of all linear transformations 
of R™*! that preserve the quadratic form Q in (6.4.9) (Exercise 6.4.25). In 
the case m = 2 the identity component of this group is isomorphic to the Lie 
group PSL(2,C) = SL(2, C)/{+1}. Geometrically this can be understood by 


examining the upper half space model of H? (Exercise 6.4.27). 


The preceding three examples are the constant sectional curvature man- 
ifolds discussed in §6.2 and §6.4. In all three cases the isometry group has 
the maximal dimension dim(Z(M)) = m(m + 1)/2 and is diffeomorphic to 
the orthonormal frame bundle O(M), so these examples satisfy the con- 
dition Z, = G, in the notation of 87.3.1. By Corollary 6.4.13 a complete, 
connected, simply connected manifold M satisfies Z, = G, if and only if it 
has constant sectional curvature. 


Exercise 7.3.17. Consider the incomplete 2-manifolds 
Mo :=R*\ {(0,0)}, My := R°\Z?. 


Prove that for i = 0,1 every isometry of M; extends to an isometry of R? and 
so Z(M;) is a subgroup of the Lie group Z(R2) of affine maps with orthogonal 
linear part (Example 7.3.14). The isometry group of Mo is isomorphic to 
the Lie group O(2). The isometry group of M; is discrete and is an example 
of a so-called wallpaper group (of which there are 17). The Lie algebra 
of Killing vector fields in both cases has dimension 3. Which Killing vector 
fields are not complete? Hint: A Killing vector field on a connected open 
set M Cc R? is a smooth map M > R?: (2,y) + (u(z,y), u(x, y)) that 
satisfies the equations 0;u = Oyv = 0 and Oyu + 0;v = 0. Deduce that the 
map (u,v) is affine and has a skew-symmetric linear part. 
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Example 7.3.18. The identity component of the isometry group of the 
complex projective space CP” (Example 2.8.5) with the Fubini-Study metric 
(Example 3.7.5) is the group 


Io(CP”) = PSU(n + 1) = 8U(n + 1)/Z(SU(n +1) & U(n + 1)/U(1). 


Here Z(SU(n + 1)) = {AD|A € St, Ant = 1} & Z/(n + 1)Z is the center 
of the group SU(n + 1). We emphasize that the dimension n(n + 2) of 
the isometry group Z(CP”) is smaller than the dimension n(2n + 1) of the 
orthonormal frame bundle O(CP”) unless n = 0 or n = 1. 

In the case n = 1 the projective line CP! is isometric to the 2-sphere 
by stereographic projection (Exercise 2.8.13 and Example 3.7.5). Hence the 
identity component PSU(2) of the isometry group of CP! is isomorphic to 
the identity component SO(3) of the isometry group of $?. An explicit 
isomorphism is discussed in Exercise 2.5.21. 

The full isometry group Z(CP”) has two connected components. In the 
case n = 2 it is an open question whether the inclusion Z(CP?) — Diff(CP7?) 
of the isometry group of CP? into the group of all diffeomorphisms of CP? is 
a homotopy equivalence. By deep results of Gromov [21] and Taubes [74] a 
positive answer to this question is equivalent to the assertion that the space 
of symplectic forms on CP? in a fixed cohomology class is contractible (the 
symplectic uniqueness conjecture for CP”). It is not even known whether this 
space is connected or, equivalently, whether every diffeomorphism of CP? 
that induces the identity on cohomology is isotopic to the identity. For a 
more detailed discussion see [67, Example 3.4] and [49, Example 13.4.1]. 


Example 7.3.19. The identity component Zo(S? x $7) of the isometry group 
of the product manifold M = S? x $? is the product group SO(3) x SO(3). 
In contrast to Smales’ Theorem the inclusion Zo(S? x $7) — Diffo(.S? x S?) 
into the group of diffeomorphisms of S? x S? that are isotopic to the identity 
is not a homotopy equivalence. For example, if ¢,9 : S? + S? denotes the 
rotation about the axis through x € S? by the angle 6 € R/2xZ and the 
diffeomorphism w9 : $2 x S? — S? x S$? is defined by 


wo(@,y) = (x, bzo(y)), LYE ae 


then the loop R/27Z — Diffo($? x S?) : 6 +4 yp is not contractible and 
neither is any of its iterates. Thus Diffo(.$? x $?) has an infinite fundamental 
group while the fundamental group of SO(3) x SO(3) is finite. For S? x $? it 
is an open question whether every diffeomorphism that induces the identity 
on cohomology is isotopic to the identity. For a more detailed discussion 
see [67, Example 3.5] and [49, Example 13.4.2]. 
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7.4 Isometries of Compact Lie Groups* 


In the following theorem we denote by Zo(M) the connected component of 
the identity in the group of all isometries of a manifold M. 


Theorem 7.4.1. Let G C GL(n,R) be a compact connected Lie group 
equipped with a bi-invariant Riemannian metric. If 6 € Io(G), then there 
exist elements a,b € G such that 


6(g) = dap(g) = agb* for allg €G. (7.4.1) 


Proof. See page 364. 


The proof of Theorem 7.4.1 makes use of the Killing form introduced in 
Example 5.2.24. Recall the definition of the center Z(g) in Exercise 2.5.33 
and of the commutant [g,g] C g in Exercise 5.2.23. The heart of the proof 
is Lemma 7.4.3. The case where the Lie algebra has a nontrivial center is 
then dealt with in Lemma 7.4.5. 


Lemma 7.4.2. Let g be a finite-dimensional Lie algebra that admits an 
invariant inner product and has a trivial center. Then the Killing form on g 
is nondegenerate. 


Proof. Exercise 5.2.26. 


Lemma 7.4.3. Let g be a finite-dimensional Lie algebra and assume that 
the Killing form on g is nondegenerate. Then the following holds. 


(i) Z(g) = {0} and [g, 9] = g. 
(ii) The Lie algebra homomorphism ad : g — Der(g) is bijective. 
(iii) Every derivation 6 : g > g has trace zero. 


(iv) Let 6: g > g be a linear map such that, for all €,,¢ € g, 
O[[é, m], 6] = [16&, m], 6] + (1, dn], ¢] + [l,m], 5¢]. (7.4.2) 


Then there exists a unique element 5 € g such that 6€ = [&5,€] for all € € g. 


Proof. We prove part (i). If € € Z(g), then ad(€) = 0, hence «(€,7) = 0 for 
all 7 € g, and hence € = 0 by nondegeneracy of the Killing form. To prove 
that [g,g] = g, assume that A : g > R is any linear functional that vanishes 
on the commutant [g, g]. Since the Killing form is nondegenerate, there exists 
an element ¢ € g such that A = «(C,-). Hence 0 = «(¢, [€, n]) = «([C, €], 7) 
for all €£,7 € g. Since the Killing form is nondegenerate, this implies [¢, €] = 0 
for all € € g, hence ¢ € Z(g), hence ¢ = 0, and hence A = 0. This proves (i). 
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We prove part (ii). The map ad: g — Der(g) is injective by part (i). 
Choose a basis &,...,& of g. Since the Killing form is nondegenerate, 
there exists a dual basis 7,...,% of g such that «(&,1j;) = 6;; for all i, j. 
These bases satisfy ¢ = )0, K(m,¢)& for all ¢ € g. Now let 6 € Der(g) and 
define € := 0, trace(6d ad(&)) € g. Then, for all ¢ € g, we have 


trace(dad(¢)) = oy K(m, ¢)trace(d ad(€;)) = K(,¢) = trace(ad(€)ad(¢)). 
Thus the derivation ¢ := 6 — ad(€) satisfies trace(ead(¢)) = 0 for all ¢ € g. 
Since |e, ad(7)] = ad(en), this implies 


K(en, ¢) = trace(ad(en)ad(¢)) = trace({e, ad(7)Jad(¢)) 
— trace(e[ad(n), ad(¢)]) = trace(e ad([n, ¢])) =0 
for all 7,¢ € g. Hence ¢ =0 by nondegeneracy of the Killing form and 
hence 6 = ad(&). This proves (ii). 
We prove part (iii). Since trace(ad([&, 7])) = trace([ad(€), ad(7)]) = 0 for 
all €,7 € g and [g,g] = g by part (i), we have trace(ad(£)) = 0 for all € € g. 
Hence (iii) follows from part (ii). 
We prove part (iv). Define the bilinear map Bs : g x g > g by 


for €,7 € g. Then equation (7.4.2) can be expressed in the form 


for £,7,¢ € g. By part (i) there exists a basis of g consisting of vectors of the 
form e; = [€;,7;]. Define the linear map A; : g > g by Ase; := —B;(&, 7). 
Then Bs(e;,¢) = [Ase;,¢] for all ¢ and ¢ by (7.4.4). Hence 


Bs(€,n) = [Asé, n] = [€, Asn] (7.4.5) 


for all £,7 € g. Here the second equality holds by the skew-symmetry of Bs. 
By the Jacobi identity and equations (7.4.4) and (7.4.5) we have 


2[Bs(€,7),¢] = [I€, Asn], ¢] + [[Asé, n], ¢] 

—[I¢, €], Asn] — [[Asn, ¢], €] + [[Asé, 7], ¢] 
=Byl6 5517) = |[Balgd),) + (Agen 4) 
Ba(C&), la Belge ).6) + [Ags alec 

= [[¢, As€], m] + [In, ¢], Asé] + [[Asé, 7], ¢] = 0 


for all €,7,¢ € g. Since Z(g) = {0} by part (i), we find that Bs(€,7) =0 
for all €,7 € g, hence 6 is a derivation, and hence 6 is in the image of the 
map ad: g + Der(g) by part (ii). This proves (iv) and Lemma 7.4.3. 
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Lemma 7.4.4. The assertion of Theorem 7.4.1 holds under the additional 
assumption that the center of the Lie algebra g = Lie(G) is trivial. 


Proof. Let ¢ € Zo(G) and choose a smooth isotopy [0,1] > Zo(G) : t+ @ 
joining the identity ¢9 = id to ¢; = @. For 0 <t <1 define the diffeomor- 
phism yw: G— G by vi(g) := o:(1)~'¢:(g). Then each y; is an isometry 
and the path [0,1] + Z(G) : t > a, satisfies 


wo = id, v(1) = 1 


for all t. By Theorem 5.3.1 the derivatives UV, := dy;(1) : g > g preserve 
the Riemann curvature tensor of G at 1 and by Example 5.2.17 (for the 
standard metric on Lie subgroups of O(n)) and Exercise 5.2.21 (for general 
bi-invariant Riemannian metrics) this translates into the condition 


WrllE, 1,6] = [[WeE, Ven], VC] (7.4.6) 


for all t and all €,7,¢ € g. Hence the endomorphism 6; := wv, sy, ‘gg 
satisfies (7.4.2) for each t. Moreover, by Lemma 7.4.2 the Killing form on g 


is nondegenerate. Hence it follows from Lemma 7.4.3 that there exists a 
smooth path [0,1] > g:t+> & such that vw, ty, = ad(&) for all t. Thus 


d 
ue = Wad), Y=. (7.4.7) 
Now let [0,1] ~ G:t++ ) be the solution of the differential equation 
Pee ips (7.4.8) 
dt t = YtCt, Oo — H, a. 


and define ©; := Ad(b:) (Example 2.5.22). Then £6, = ®,ad(€;) for all t 
and ®) = Il. Thus ®; = YW and so 


Win = bynb; * (7.4.9) 


for allt and 7. Take t = 1, define b := b;, and use Lemma 5.1.10 (uniqueness 
of local isometries) to obtain 71(g) = bgb~! for all g € G. Hence 


(9) = P(U)v1(g) = d()bgb~* = agh 
for all g € G with a := ¢(1)b. This proves Lemma 7.4.4. 


We will denote by Z(G) C Z(G) the connected component of 1 in the 
center of G. Then Z(G) C G is a compact connected abelian Lie subgroup 
of G (Exercise 2.5.33). Since G is connected, its Lie algebra is the center 
of g, ie. Lie(Zo(G)) = Z(g). 
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Lemma 7.4.5. Let GC GL(n,R) be a compact connected Lie group with a 
bi-invariant Riemannian metric and let 6 € Io(G). Then 


o(hg) = o(gh) = o(g)h = hd(g) (7.4.10) 
for allg € G and all hh € Z(G). 


Proof. The proof relies on the following basic observations. 
(a) Z(G) is a compact connected Lie subgroup of G and Lie(Zo(G)) = Z(g). 


(b) exp(€ + 7) = exp(§) exp(77) for all €,7 € Z(g). 
(c) The exponential map exp : Z(g) + Zo(G) is surjective. 


(d) A := {€ € Z(g)| exp(€) = 1} is a discrete additive subgroup of Z(g) 
which spans Z(g). 


Part (a) was noted above, part (b) follows from Exercise 2.5.38 because the 
Lie algebra Z(g) is commutative, and part (c) follows from the Hopf-Rinow 
Theorem 4.6.6. That the set A is an additive subgroup of Z(g) follows 
directly from (b). Moreover, by (a) and (b) the exponential map restricts to 
a local diffeomorphism exp : Z(g) + Zo(G). Hence A is discrete and by (c) 
the exponential map descends to a proper map from Z(g)/A onto Z(G). 
Since Zo(G) is compact, the lattice A spans the vector space Z(g). 
Now assume ¢(11) = Il and define ® := d¢(1l) : g > g. Then 


o(exp(€)) = exp(®E) (7.4.11) 


for all € € g by Corollary 5.3.3 and Example 5.2.17. Moreover, ® is an 
orthogonal transformation of g that preserves the Riemann curvature tensor 
(Theorem 5.3.1). Thus |[®€, ®7]| = |[€,7]| for all €,7 € g by Example 5.2.17. 


So, if € € Z(g), then [€,n] =0 for all 7, hence [®£, ®n] = 0 for all 7, and 
hence ®€ € Z(g). This shows that 


bZ(g) = Z(g). (7.4.12) 


By (7.4.11) and (7.4.12) we have € € A if and only if ¢(exp(é)) = 1 if and 
only if exp(®€) = Lif and only if ®€ € A, so that ®A = A. Since ¢ is isotopic 
to the identity through isometries, by assumption, there exists a smooth 
path of orthogonal transformations ®; of g from ®)9 = 1 to ®; = ® that 
satisfy ®,A = A for all t. Thus ®€ = € for all € € A and so for all € € Z(g) 
because the lattice spans the subspace Z(g) by (d). Hence, by (7.4.11) 
and (c), we obtain $(h) = h for all h € Z(G) whenever ¢(1) = 1. 

To prove equation (7.4.10) in general, fix an element g € G and define the 
diffeomorphism ¢, : G > G by ¢y(h) := ¢(g) 1 (gh) for h € G. Then d, is 
an isometry and ¢,(1l) = 1. Moreover, ¢g € Zo(G), because G is connected. 
Hence ¢,(h) = h for all h € Z(G) and this proves Lemma 7.4.5. 
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Proof of Theorem 7.4.1. Let @ € Zo(G). By Lemma 7.4.5, ¢ descends to a 
diffeomorphism ¢ : G > G of the quotient group 


G := G/Z(G). 
The Lie algebra of G is the quotient space 


G:= g/Z(g) = Z(g)- 


and the invariant inner product on g restricts to an invariant inner product 
on the orthogonal complement of Z(g). This defines a bi-invariant Rie- 
mannian metric on G. The diffeomorphism ¢ is an isometry with respect 
to this metric, because the derivative dé(g) : T7G — Tyg)G is an orthog- 
onal transformation, which by (7.4.10) sends a tangent vector gn € TG 
with 7 € Z(g) to dé(g)gn = $(9)n € Tyg)G, and hence it sends the sub- 
space gZ(g)+ C TyG to 6(g)Z(g)t C Ty(g)G- Apply Lemma 7.4.4 to the 
isometry ¢ to obtain elements a,b € G whose equivalence classes a,b € G 
satisfy ¢(g) = agb~! for all g € G. Then it follows from Lemma 7.4.5 that 
#(g) =agb~! for all g € G. This proves Theorem 7.4.1. a 


Corollary 7.4.6. Let G be a compact connected Lie group equipped with 
a bi-invariant Riemannian metric. Then the map (a,b) > dq» in Theo- 
rem 7.4.1 descends to a Lie Group isomorphism 


pa: (G x G)/Z(G) > (GC). (7.4.13) 


Proof. The map G x G — Zo(G) : (a,b) + ¢a,y is a group homomorphism by 
definition, is smooth by Theorem 7.3.1, and is surjective by Theorem 7.4.1. 
Moreover, ¢g,p is the identity if and only if a = b € Z(G). Hence the map pq 
in (7.4.13) is a Lie group isomorphism, with the smooth structure on the 
quotient group (G x G)/Z(G) determined by Theorem 2.9.14. This proves 
Corollary 7.4.6. 


Example 7.4.7. Theorem 7.4.1 establishes a one-to-one correspondence 
between isometries ¢ € Zp(G) that satisfy @(1) = 1 (the case a = b) and Lie 
group automorphisms of G in the identity component. This correspondence 
does not extend to other connected components. 

For example, in the case G = T” = R"/Z” the group of automorphisms 
of T” is the infinite group Aut(T”) = GL(n,Z) of integer matrices with 
determinant +1, while the group of isometries that fix the origin is the 
finite subgroup O(n, Z) C GL(n,Z) of orthogonal integer matrices. Also, 
if G is not abelian, then the isometry G > G: g +> g7! is not a Lie group 
automorphism, but a Lie group anti-automorphism. 
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Exercise 7.4.8. Examine the case G = SU(2) (Example 2.5.20) and deduce 
that there exists a Lie group isomorphism SO(4) = (SU(2) x SU(2)) /{+1}. 


Exercise 7.4.9. Let G be a compact Lie group equipped with a bi-invariant 
Riemannian metric and let g := Lie(G). Show that the formula 


[(6,8), (6/,8°)] s= (86/ - 86, [5,07] + dad([¢,¢])) (7.4.14) 


for ¢,¢’ € g and 6,06’ € Der(g) defines a Lie bracket on g x Der(g). Show 
that the map Vectz(G) — g x Der(g) : X + (X(1), VX(1)) identifies the 
Lie algebra Vectx(G) = Lie(Z(G)) of Killing vector fields on G with the 
Lie algebra g x Der(g). Show that the homomorphism (7.4.13) induces the 
surjective Lie algebra homomorphism 


ax gg x Der(g) : (En) + (€—n, fad(E +), (7.4.15) 


whose kernel consist of all pairs (€,7) € g x g that satisfy € = 7 € Z(g). 
Show that dim(Der(g)) = dim(g) — dim(Z(g)). 


7.5 Convex Functions on Hadamard Manifolds* 


The last two sections of this book are devoted to Donaldson’s beautiful 
paper [17] in which he develops a differential geometric approach to Lie 
algebra theory. The results of [17] will be explained in reverse order. The 
first subsection examines the sphere at infinity of a Hadamard manifold M 
and contains a proof of [17, Theorem 4], which asserts that every convex 
function f : M — R that is invariant under the action of a Lie group G 
by isometries must attain its minimum whenever the G-action has no fixed 
point at infinity (§7.5.1). The next subsection deals with the special case 
of [17, Theorem 3], where M is the manifold of inner products on a vector 
space V on which a Lie group G C SL(V) acts irreducibly (§7.5.2). If G is 
the identity component of the isotropy subgroup of a nonzero vector w © W 
under a representation of the special linear group p: SL(V) > SL(W), then 
by [17, Theorem 2] there exists an inner product on V for which the Lie 
algebra g of G is symmetric (§7.5.3). This is used in [17, Theorem 1] in 
the case where V = g is a simple Lie algebra, w is the Lie bracket, and G 
is the identity component of the group of automorphisms of g, to establish 
the existence of symmetric inner products on g and deduce various standard 
results in Lie algebra theory. These applications to Lie algebra theory are 
deferred to the next section. 
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7.5.1 Convex Functions and The Sphere at Infinity 


Assume throughout that M is a Hadamard manifold, i.e. a nonempty, con- 
nected, simply connected, complete Riemannian manifold with nonpositive 
sectional curvature (Definition 6.5.1). For p € M denote the unit sphere in 
the tangent space T,,M by =: 


Sp := {vu € TM | |v| = 1}. 


Their union determines a submanifold SM := {(p,v)|p € M, v € Sp} of the 
tangent bundle, called the unit sphere bundle. 


Definition 7.5.1. Define an equivalence relation ~ on SM by 


(p,v) ~ (q, w) oy a d(exp,(tv), exp,(tw)) < oo (7.5.1) 


for (p,v),(q,w) € SM. The equivalence class of a pair (p,v) € SM will be 
denoted by [p,v| := {(q,w) € SM | (q,w) ~ (p,v)} and the quotient space 


Sxo(M) = SM/~ = {[p,»]|(p,v) € SM} 
is called the sphere at infinity of M. 


The following lemma shows that the map 5S, > S..(M):v' [p,v] is 
a homeomorphism with respect to the quotient topology on S..(M) for 
every p € M (see [17, Lemma 3}). 


Lemma 7.5.2. There exists a unique collection of maps Fyp : Sp > Sq, 
one for each pair of points p,q © M, such that 


(7.5.2) 


is exp, (exp, (Rv)) 
- R- |expg '(exp,(Rv))| 


for all p,qe€ M and all v € S,. Moreover, the convergence in (7.5.2) is 
uniform on Sp, the maps Fy are homeomorphisms, and they satisfy 


w = Fyp(v) => sup d(exp, (tv), exp,(tw)) < 00 (7.5.3) 
t>0 


for all (p,v),(q,w) € SM and 
Fig Fiup =Frp,  Fop =i (7.5.4) 


for allp,qr € M. 


7.5. CONVEX FUNCTIONS ON HADAMARD MANIFOLDS* 367 


exp,(tw(R)/R) expel) 


Figure 7.1: The sphere at infinity 


Proof. Let p,q € M and define the maps FR gp) : Sp + Sq by 


= 

expy (exp, (Rv)) 

FR qp(v) = 1 (7.5.5) 
lexpg (exp,(Rv))| 

for R > 0 and v € Sp. We claim that, for all R > d(p,q) and all v € Sp, 
0 d(p, q) 
—F < 7.5.6 
OR R,qp(v) = R(R — d(p.q)) ( ) 


To see this, fix an element v € S, and define the geodesic y : R — M 
by 7(t) := exp,(tv). Define the curve w: R + T,M by w(t) := expy'(y(t)) 
fort € R. Then Frop(v) = w(R)/|w(R)| and hence 


0 wR) (“s om) w(R) 


aR’ eae) = Toca \jwCRY fe(RYT/ Toc 


This is the orthogonal projection of the vector w(R)/|w(R)| onto the or- 

thogonal complement of w(R). Hence its length connot decrease by adding 

to it a scalar multiple of w(R), and so 

0 Z | w(R) wl) | _ CR) — w(R)/ RL 
~ |/w(R)|  Riw(R)| 


an* Rael) ] hot) 


Next we use the expanding property of the exponential map in Theorem 6.5.2 
twice, namely first the inequality for the derivative in part (ii) at the point q 
and then the inequality in part (iii) at the point 7(R) (see Figure 7.5.1). 
Define the tangent vectors vp, vq € Ty(p)M by a 


Up = —7(R) = — dexp,(w(R))w(R), 


(7.5.7) 


Uq ‘= 


=o exp, (tw(R)/R) = —dexp,(w(R))w(R)/R. 
t=R 
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Then Theorem 6.5.2 yields the estimate 
|w(R) — w(R)/R| < |dexp,(w(R))(w(R) — w(R)/R)| = up — vq). (7.5.8) 


Also, exp,p)(SUp) = exp,((R — s)v), exp, py($Uq) = exPg((R — s)w(R)/R). 
Take s = R—t and use Lemma 6.5.5 to obtain, for0 <t< R, 


d(exp,(tv), expg(tw(R)/R)) _ d(p,q) 
op — vg] < PO Pe a (7.5.9) 
Since d(p, y(R)) = R and d(q, y(R)) = |w(R)|, the triangle inequality yields 


R—d(p,q) < |w(R)| < R+d(p,q). (7.5.10) 
Combinig the inequalities (7.5.7), (7.5.8), (7.5.9), and (7.5.10) we find that 


; 15.7), (758) (759), and (7.5.10 


2 en (u)) < MR MRR — rym) - dle) 
OR |w(R)| |w(R)| ~ R(R—d(p,q)) 
This proves the estimate (7.5.6). 

It follows from (7.5.6) by integrating from a fixed number R > d(p,q) to 
infinity that the family of functions FR 4»): Sp — Sq converge uniformly to 
a function Fy, : Sp — Sq as R tends to infinity and that 


~ — d(p,q) _ R 
sup Fap(v) ~ Fraplv)! S | cerry dee € =. a) 


for all R > d(p,q). Thus the convergence is uniform and hence the limit 
function Fy, is continuous. 
Next we prove (7.5.3). Let p,q © M and v € Sp. Then 


w(R w(R 
1 Fogo) = fim, Fool) = fi. Foc = 
Here the third equality follows from (7.5.5) and the definition of the vec- 
tor w(R) = exp; '(exp,(Rv)), and the last equality follows from (7.5.10). 
Hence exp, (tw) = limp exp, (tw(R)/R), and so it follows from (7.5.9) by 
taking the limit R — oo that d(exp,(tv),exp,(tw)) < d(p,q) for all t > 0. 
This proves “ => ” in (7.5.3). The converse implication follows from the 
fact that, by Theorem 6.5.2, there can be at most one tangent vector w € Sy 
satisfying sup;s9 d(exp,(tv), exp,(tw)) < oo. Thus we have proved that the 
maps Fy, : Sp — Sq in (7.5.2) satisfy (7.5.3). That they also satisfy (7.5.4) 
follows directly from (7.5.3) and the fact that (7.5.1) defines an equivalence 
relation on SM. Hence each map Fy, is a homeomorphism with the in- 
verse Fy, and this proves Lemma 7.5.2. 
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Lemma 7.5.3. If ¢€ T(M), p,qe M, andv € Sp, then 


Fgq),6(p) ° 40(p) = dO(q) © Fa,p : Sp + Socq): (7.5.11) 


Thus the group of isometries of M acts continuously on the sphere at infinity 
via T(M) x Soo(M) + Soo(M) : (4, [p, v]) > d.[p, v] = [d(p), dbp). 


Proof. Since @ is an isometry it satisfies @ o expy = exPgvq) odd(q) for 
all g € M by Corollary 5.3.3. Hence 


do(q) (exp, (exp,(Re)) ) — EXP 5(q) (o(exp,(Re))) 
= EXP 5) (expo) (R d(p)v)) 
and so d¢(q) ° Fra» = FR,4(q),¢(p) 0 €¢(p) for all p,q eM and all R>0. 


Divide by the norm and take the limit R — co to obtain (7.5.11). This 
proves Lemma 7.5.3. 


Definition 7.5.4. Let GC GL(n,R) be a Lie group. A G-action on M 
by isometries is a Lie group homomorphism G— I(M):9+> dq, i.e. 
the map Gx M > M:(g,p) +> ¢g(p) is a smooth group action (Defini- 
tion 2.5.39) and the map ¢,: M — M is an isometry for each g €G. In 
this situation we say that the G action has a fixed point iff there exists an 
element p € M such that dg(p) = p for allg € G. We say that the G-action 
has no fixed point at infinity iff the induced G-action on the sphere at 
infinity has no fixed point, i.e. for every p € M and every v € Sy there exists 
an element g € G such that dog(p)v 4 F4, (p) p(v). 


Definition 7.5.5. A smooth function f : M — R is called convex iff the 
function foy:R— R is convex for every geodesic y: R-> M. 


With these preparations in place we are ready to state the following exis- 
tence theorem for critical points of a convex function (see [17, Theorem 4]). 


Theorem 7.5.6 (Donaldson). Let M be a Hadamard manifold equipped 
with a smooth action G+ ZI(M): g++ ¢q of a Lie group G by isometries, 
let K be a compact subgroup of G, and let f : M > R be a convex function 
such that fod, =f for allg © G. Assume that the G-action has no fixed 
point at infinity. Then there exists an element po © M such that 


f(po) = inf f(p), du(po) =po for allucK. (7.5.12) 
pEeM 


Proof. See page 372. 
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As pointed out in [17], similar results can be found in the works of 
Bishop-O’Neill [7] and Bridson—Haefliger [11, Lemma 8.26]. We remark 
that the compactness of the subgroup K C G is only needed for an appeal to 
Cartan’s Fixed Point Theorem 6.5.6. If we assume instead that the action 
of K on M has a fixed point, compactness is not required. The proof of 
Theorem 7.5.6 is based in the following lemma (see [17, Lemma 5)). 


Lemma 7.5.7. Let p; be a sequence in M such that limj_.. d(p, pi) = 00 
for some (and hence every) p € M. Let IT Cc T(M) be a collection of 
isometries of M such that sup, d(p;,¢(pi)) < co for allée TZ. Then the 
isometries in IZ have a common fixed point at infinity, i.e. there exists an 
element [p,v] € Soo(M) such that ¢,[p,v] = [p, v] for all ¢ € T. 


Proof. Fix an element p € M and define 


-1 
exp, (pi) _ 

Ui= ——— Rj := |exp; '(pi)| = d(p, pi), (7.5.13) 
for each i € N such that p; 4 p. Passing to a subsequence, if necessary, we 
may assume that p; # p for alli € N and that the limit v := limj4.. ui € Sp 
exists. We will prove that d¢(p)v = Fyyp)p(v) for all ¢ € Z. To see this, 
let @ € Z, choose c > 0 such that d(p;, 6(pi)) < c for all 7, and define 


=" 
expy (9(pi)) Z 
Uj, = ep Rj == lexp,' (¢(pi))| = d(p, op). 
Since exp,(Rjv;) = $(p;) and exp,(Rjvi) = pi, it follows from Theorem 6.5.2 
that |Riv; — Rivt| < d(pi,¢(pi)) < c for all i. Since |R; — Ri| < c by the 
triangle inequality, this implies |v; — v;| < 2c/R; and hence lim;_,.. Ri = 00 
and lim; jog , = lity seg 1% = 0. Since exp, (Rjv;) = $(p;), this implies 


— sien &PT (Cd) 
- -1 t,t ee a ey mee 
re lexpy (expp(Riu))] > lexpy "(6(P9) 


for allg € M. Take q = ¢(p) and use the identity EXP 5p) of = d¢(p) cexp;* 
and equation (7.5.13) to obtain 


_. do(p)expy'(pi) es 


Hence ¢,[p,v] = [p, v] for all ¢ € Z and this proves Lemma 7.5.7. 
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The next step in the proof of Theorem 7.5.6 is to examine the gradient 
flow of the convex function f : M— R. The flow equation has the form 


V(t) = -VF(V@)), (7.5.14) 


where the gradient vector field is defined by (Vf (p), v) := df(p)v for p € M 
and v € T,M. An important consequence of convexity is that the distance 
between any two solutions of the gradient flow equation is nonincreasing. 
This is the content of the next lemma (see [17, Lemma 6]). 


Lemma 7.5.8. Let f: M—R be a smooth function. Then f is convex if 
and only if it satisfies the condition 


(WV f(p),v) = 0 (7.5.15) 


for allp€ M and allv €T,M. If f is convex, then the following holds. 
(i) Equation (7.5.14) has a solution y: |[0,00) + M on the entire positive 


real axis for every initial condition 7(0) = po. 
(ii) Let yo : (0,00) > M and 1: [0,co) > M be two solutions of (7.5.14). 


Then the function [0,co) > R: t+ d(t) := d(yo(t), y1(t)) is nonincreasing. 
Proof. Let y: R- M be a geodesic. Then fo7y:R-— R is convex if and 
only if 
a? d ; ‘ 
0 < Saf) = FV FOM), IO) = YwmVFOM), 1) 
for all t. This holds for all geodesics if and only if f satisfies (7.5.15). In the 


remainder of the proof we assume that f is convex. 
Let po € M and for T > 0 define 


Kr := {p € M|d(po,p) < T|VF (po) |} - 


This set is compact because M is complete. Now let 7: [0,7) -— M be the 
solution of (7.5.14) with (0) = po on some time interval [0,7 ). Then 


£ WA)? =2 (VF), VEC) = -2 (GVF(),4) <0 


by (7.5.15). Thus the function t + |Vf(y(t))| = |7()| is nonincreasing and 
so y(t) € Kr for 0 < t < T. Since Kr is a compact subset of M, the 
solution y extends to a longer time interval [0,7'+ 6) for some 6 > 0 by 


Corollary 2.4.15. Since T > 0 was chosen arbitrary, this proves (i). 
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We prove part (ii). Assume without loss of generality that yo(0) 4 y1(0) 
and so yo(t) # y(t) for all t. For t > 0 let [0,1] > M: 5s (s,t) 
be the unique geodesic that satisfies y(0,t) = yo(t) and y(1,t) = y(t). 
Then d(t) = d(yo(t), y1(t)) = L(y(-, t)) = |Osy(s, t)| for all s,t and hence 


.. _ ft (ds7(s,t), VOsy(s, t)) 1 
a | ante) -aha 
: (Os7( 


= ap (0010.8). VF, t))) — (Oxy 


(0s7(8,t), 7(s, t)) ds 


Os 
0,8), VF((0,)))) 


1 


1 
7 8, (Os7(8,t), Vasy(s.t) f(1(s,t))) ds < 0 


by (7.5.15). This proves (ii) and Lemma 7.5.8. 


Lemma 7.5.9. Let f: M—->R be a convex function that has a critical 
point Po. Then f(p) > f (Poo) =: ¢ for all p € M, and the set Cy := f~*(c) 
of minima of f is geodesically convex. 


Proof. Let p € M and let 7: [0,1] > M be the unique geodesic with the 
endpoints 7(0) = po to y(1) =p. Then 6 := foy: [0,1] > R is a convex 
function satisfying 3(0) = f(poo) = c and 6(0) = 0, hence A(t) > c for all t, 
and so f(p) = 6(1) >c. Thus f attains its minimum at po. 

Now let po,p1 € Cy and let y : [0,1] + M be the unique geodesic with the 
endpoints 7(0) = po and y(1) = p;. Then the function 6 := fo7y: [0,1] >R 
is convex, satisfies 6(0) = 3(1) =c, and takes values in the interval [c, co). 
Hence 6 = c and so y(t) € Cy for all t. This proves Lemma 7.5.9. 


Proof of Theorem 7.5.6. Choose an element po € M and let y : [0,00) > MW 
be the unique solution of equation (7.5.14) that satisfies the initial condi- 
tion y(0) = po. Consider an arbitrary sequence t; — oo. Assume first that 


sup d(po, y(t) = 00 (7.5.16) 


and choose a sequence t; co such that limj+.o d(po, y(ti)) = co. Now 
let g€G. Since ¢,: M— M is an isometry and fod, =f, it follows 
that ddg(p)V f(p) = Vf(¢g(p)) for all p € M (Exercise: Prove this.) Hence 
the curve [0,00) + M:t+> ¢g(7(t)) is another solution of equation (7.5.14) 
and hence d(y(ti), dg(y(ti))) < d(po, dg(po)) for all i and all g € G by part (ii) 
of Lemma 7.5.8. Hence Lemma 7.5.7 asserts that there exists a [p,v] €© SM 
such that dé,g(p)v = Fg, (p),p(v) for all g € G, in contracdiction to our as- 
sumption that the G-action has no fixed point at infinity. 
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This shows that our assumption (7.5.16) must have been wrong. Thus 
sup d(po, V(t) =: R < 00, 
t>0 


and so our solution y : [0,0co) + M of (7.5.14) takes values in the compact 
set B := {p€ M|d(po,p) < R}. Since the function t 4 f(y(t)) is non- 
increasing, this implies that the limit 


e= Jim f(y(t)) = min f(p) (7.5.17) 


exists and is a real number (and not —oo). Since $f (7(t)) = —|Vf(7(t))|? 
and the function t+ f(7(t)) is bounded below by c, there must exist a 
sequence t; — co such that 

lim Vf (y(é;)) = 0. (7.5.18) 

woo 
Since y(t;) € B for all 7, we may also assume that the limit 

Pog t= met) (7.5.19) 
1-00 

exists (after passing to a subsequence, if necessary). This limit is a critical 
point of f by (7.5.18), and f(p..) = c by (7.5.17). Hence, by Lemma 7.5.9, f 
attains its minimum at po and the set Cy := {p € M| f(p) = c} of minima 
of f is geodesically convex. We must find an element of Cy that is fixed 
under the action of K. By Cartan’s Fixed Point Theorem 6.5.6 there exists 
aqé€WM such that ¢,(q) = q for all u€ K. Since M is complete and Cy is 
a nonempty closed subset of M, there exists an element po € Cy such that 


d(q, po) = = d(q,p) =: 6. (7.5.20) 


We claim that ¢.(po) = po for all u € K. To see this, fix an element u € K, 
let y: [0,1] + M be the geodesic joining 7(0) = po to y(1) = ¢u(po), and 
denote by m := (1/2) the midpoint of this geodesic. Then Lemma 6.5.7 
asserts that —— 


d(po, du(po))? 


2 < d(q, po)” + 44, bu(Po))”:- (7.5.21) 


2d(q,m)? + 
Since ¢u(q) = q and ¢, is an isometry, we have d(q, du(po)) = d(q, po) = 4, 
and since Cy is geodesically convex, we have m € Cy and so d(q,m) > 6. 
Hence it follows from (7.5.21) that po = ¢u(po). This shows that po € Cf is 
a fixed point for the action of K on M and proves Theoren 7.5.6. 
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Example 7.5.10. To illustrate the argument in the proof of Theorem 7.5.6, 
take M =C with the standard flat metric. Then the equivalence relation 
on the sphere bundle SM = C x S' is given by translation in C and so the 
sphere at infinity is $.,(M) = $'. The orthogonal group G = O(2) acts by 
isometries on M and has no fixed point at infinity. The subgroup K = Z/2Z 
acts by complex conjugation and its fixed point set is the real axis. Choose 
a smooth convex function h : R — R that vanishes on the interval [—1, 1] 
and is positive elsewhere. Then the function f(z) := h(|z|) is convex and G- 
invariant and the set Cy of minima of f is the closed unit disc in C. If q = 2 
is the fixed point of the K-action chosen in the proof, then po = 1 € Cy. 
If G=K = Z/2Z, then +1 are the fixed points at infinity and f(z) := eRe) 
is convex and G-invariant, but does not take on its infimum. 


Example 7.5.11 ([17]). Consider the case where M = D” is the Poincaré 
model of hyperbolic space (Exercise 6.4.22). Then the sphere at infinity 
is the boundary 0D” = S™~! (Exercise 6.4.24). If the convex function f 
extends continuously to the closed ball and does not take on its minimum 
in M, then it attains its minimum at a unique point on the boundary, 
because any two boundary points are the asymptotic limits of a geodesic 
in M. Hence the minimum on the boundary is fixed under the action of any 
Lie group on M by isometries, that leave f invariant. This is reminiscent of 
the Kempf Uniqueness Theorem in GIT (see [37] and [20, Theorem 10.2]), 
where M = G/K is associated to the complexification G of a compact Lie 
group K and f is the Kempf—Ness function (see [38, 53] and [20, §4]). 


7.5.2 Inner Products and Weighted Flags 


We will now turn to a specific example, where the Hadamard manifold is the 
space of positive definite symmetric matrices with determinant one (§6.5.3). 
Following [17], we choose a finite-dimensional real vector space V equipped 
with a fixed inner product (-,-). Then every inner product on V has the 
form (v,v')p := (v,P~'v’) for some self-adjoint positive definite automor- 
phism P. Denote the set of such automorphisms with determinant one by 


PV) := {P € End(V) | P* = P > 0, det(P) = 1}. (7.5.22) 


Here P* € End(V) is defined by (v, P*v’) := (Pv, v’) for v,v’ € V, and the 
notation “P > 0” means (v, Pv) > 0 for all ve V \ {0}. Thus A(V) isa 
codimension-1 submanifold of the space of self-adjoint endomorphisms of V. 
Its tangent space at P € Ao(V) is given by 


TpPo(V) := {P € End(V)| P = P*, trace(PP~) = 0}. (7.5.23) 
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The Riemannian metric on Yo(V) is defined by 


|Plp:= \/trace(PP-!PP-!) (7.5.24) 


for P € A(V) and P€ TpPo(V) as in §6.5.3, and so Ao(V) is a Hadamard 
manifold by Theorem 6.5.9. For P € Tp Yo(V) the endomorphism PP~! is 
self-adjoint with respect to the inner product (-,P~!-) on V. 

The group SL(V) C GL(V) of automorphisms of V with determinant 
one acts on #o(V) by the isometries ¢,(P) := gPg* for g © SL(V). The 
isotropy subgroup of 1 € Ao(V) is the special orthogonal group SO(V). 
The action of a subgroup G C SL(V) on V is called irreducible iff there 
does not exist a linear subspace E CV, other than FE = {0} and EF=V, 
such that gE = E for all g © G. This notion can be used to carry over the 
general existence theorem in §7.5.1 for critical points of a convex function 
to the present setting (see [17, Theorem 3}). 


Theorem 7.5.12 (Donaldson). Let G C SL(V) be a Lie subgroup such 
that the action of G on V is irreducible. Let K C G be a compact subgroup 
and let f : Ay(V) > R be a convex function such that f(gPg*) = f(P) for 
allg € G and all PE AV). Then there exists a Py € Ao(V) such that 


Fo) = . int P), Pou* = P Lete KK, 7.5.25 
f(Po) = , int FP), uFyu" = Py for all u (7.5.25) 


Proof. See page 378. 


The goal will be to deduce Theorem 7.5.12 from Theorem 7.5.6. Thus 
we must understand the sphere at infinity of the space Yo(V). This will be 
accomplished with the help of the following definition. 


Definition 7.5.13. A weighted flag in V is a pair (F,), where F is a 
finite sequence of linear suspaces 


{=F chCHc-CK=V 


such that n; := dim(F;)/dim(Fj;_1) > 0 fori=1,...,r, and p is a finite 
sequence of real numbers p14 > 2 >-+: > py satisfying the conditions 


r ie 
Soni =0, So nip, = 1. (7.5.26) 
=1 i=1 


Let F = F(V) be the set of weighted flags. The group SL(V) acts on F(V) 
by 9: (Fy) = (9Fi, mids for (Fy) = (Fi, midi € F(V) and g € SL(V). 
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For P € Ao(V) the unit sphere in the tangent space Tp Yo(V) is the set 
p= {P = P* € End(V) | trace(PP~*) =), trace(PP-!PP-) =}, 


Let P € A(V) and P € Yp. Then the endomorphism PP-lis self-adjoint 
with respect to the inner product (-, P~1!-) and hence has only real eigenval- 
ues {1 > M2 >+++ > Ur. For each 2 let E; C V be the eigenspace for ju; and 
define n; := dim(£;). Since trace(PP—) =() and-trace( PP PP") = 1, 
the p;,7; satisfy (7.5.26). The weighted flag of (P, P) is defined by 


ip(P) = (Fy py) = ((Fi, 441), atti (F,, Hr) EF (Vv), (7.5.27) 


where F; := EF, ®@ Eo @---@ E; fori=1,...,r. For each P€ AV) the 
map up : Yp — F(V) defined by (7.5.27) is bijective, and thus induces 
a (compact, metrizable) topology on F(V). This topology is independent 
of P as the next lemma shows. The lemma also shows that the space of 
weighted flags is the sphere at infinity (see [L7, Lemma 4]). 


Lemma 7.5.14. The equivalence relation in Definition 7.5.1 on the unit 
sphere bundle SAo(V) is given by 


(P,P)~(Q,Q) <= — up(P) = t(Q) (7.5.28) 


for P,Q € AV), P&.Yp, and Q € Lg. Thus the map Fo,p: /p > -%q 
as defined in Lemma 7.5.2 is given by Fo,p ='g lp’. Moreover, the 
map SPo(V) > F(V): (P,P) tp(P) is SL(V)-equivariant, i.e. 


tgPgt (gPg") = 9 -up(P) (7.5.29) 
for all P € Py(V), all P € Yp, and all g € SL(V). 


Proof. Since (gPg*)(gP9*)—* = g(PP-!)g7} for all P € .Yp and g €SL(V), 
equation (7.5.29) follows directly from the definitions. The proof that the 
equivalence relation satisfies (7.5.28) rests on the following claims. 

Claim 1. Let S¢.%, let (Fj, wi)'_, = a(S) be the flag associated to S, 
and let h € SL(V) be an automorphism of V with determinant one such that 


hia = Fy ie 2 ers (7.5.30) 
Then (1, S) ~ (hh*, RSA"). 


Claim 2. Let g © SL(V) and fiz any flag {0} = PO CMC::-CR=V. 
Then there exist elements hE SL(V) and u€ SO(V) such that h satis- 
fies (7.5.30) and g = hu. 
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We prove, that these two claims imply (7.5.28). Fix any element S € .%, 
let (Fi, wi)}_1 = ¢1(S) be the weighted flag of S, and let P € Ao(V). Choose 
an element g € SL(V) such that gg* = P, and choose u and fh as in Claim 2. 
Then hh* = P, and the pairs (P,P) := (hh*,hSh*) and (1,9) have the 
same flag by (7.5.30) and are equivalent by Claim 1. Hence any pair (P, P) 
is equivalent to (1, S) if and only if it has the same flag. By transitivity of 
the equivalence relation we deduce that (7.5.28) holds for all P,Q € A(V). 

We prove Claim 2. Let F! := g~1F;, and m; := dim(F) for i = 1,...,r. 
Then choose orthonormal bases €1,...,€m and e},...,e,, of V such that for, 
each i, the vectors €1,...,€m,; form a basis of F; and the vectors e4,..., ein, 
form a basis of FY. Define the orthogonal transformation u by ue}, := e; 
for i=1,...,m. It satisfies F! = u-'F; and hence gu~!F; = F; for all i. 
Thus h := gu7! satisfies the requirements of Claim 2. 

We prove Claim 1, following [17, Lemma 4]. Define the geodesics yo, 1 
in Ao(V) by yo(t) = exp(tS) and 71 (t) = hexp(tS)h* (see Lemma 6.5.17). 
By equation (6.5.11) the square of their distance is given by 


p(t) := d(exp(tS), hexp(tS)h*)? = trace ((log (M(t) M(t)"))”) (7.5.31) 
M(t) := exp(—t$/2)hexp(tS/2). 


In the eigenspace decomposition V = EF, @--- @ E, of the self-adjoint endo- 
morphism S the automorphisms h and exp(tS/2) have the form 


hit hig +++ Map 
_ 0 hag . 
: . fin. die (7.5.32) 
O -) 0 App 


exp(t$/2) = diag(e'"/* Ip, , et2/7 Ip... er /2 Tn). 


Here the upper triangular form of h follows from (7.5.30). Hence 


hit Ayo(t) «++ — hir(t) 
M(t) = 4 hop (7.5.33) 
ff he Bea) 
0 ve 0 Ppp 


where hj;(t) := ei #9)/2h;; for 1< i<j <r. Since pj > py for i<j, it 
follows that the limit M. := limy4.. M(t) = diag(hi,...,hrr) exists and 
is an invertible endomorphism of V. Hence the function p: [0,00) > R 
in (7.5.31) is bounded. This proves Claim 1 and Lemma 7.5.14. 
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Proof of Theorem 7.5.12. Let GC SL(V) be a Lie subgroup which acts ir- 
reducibly on V. Then G acts on Yo(V) by isometries. By Lemma 7.5.14 
the induced action on the sphere at infinity S..(Wo(V)) = F(V) is given 
by Gx F(V) > F(V) : (9, (Fy, Made=s,...r) (9s, Ma)é=i,...,r- This action 
has no fixed points because the action of G on V is irreducible and r > 2 
for each weighted flag (Fi, Mi)i=1,.... € A(V). Hence all the assertions of 
Theorem 7.5.12 follow directly from Theorem 7.5.6 with M = A(V). 


7.5.3 Lengths of Vectors 


The material in this section goes back to ideas in geometric invariant theory 
developed by Kempf-—Ness [38], Ness [53], and Kirwan [40] in the complex 
setting and by Richardson-Slodowy [59] and Marian [48] in the real setting. 
We assume throughout that V,W are finite-dimensional real vector spaces 
and p: SL(V) — SL(W) is a Lie group homomorphism. Note that every 
Lie group homomorphism from GL(V) to GL(W) restricts to a Lie group 
homomorphism from SL(V) to SL(W), because every Lie group homomor- 
phism from GL(V) to the multiplicative group of nonzero real numbers is 
some power of the determinant. 

Fix a nonzero vector w € W and denote by Gy C SL(V) the connected 
component of the identity in the isotropy subgroup of w, i.e. 


4a smooth path ¥: [0,1] > SL(V) 
Gw := 4g € SL(V) | such that 7(0) = 1, y(1) = g, and . (7.5.34) 
p(y(t))w = w forO<t<1 


By Theorem 2.5.26 this is a Lie subgroup of SL(V) with the Lie algebra 


dw = {€ € sl(V) | A(€)w =O}. 
There are many examples of this setup that are related to interesting ques- 
tions in geometry. The vector space W can be the space of all symmetric 
bilinear forms on V and w can be an inner product, in which case Gy is 
the special orthogonal group associated to the inner product, or w can be 
the quadratic form (6.4.9), in which case G, is the identity component of 
the isometry group of hyperbolic space. Or W can be the space of skew- 
symmetric bilinear forms on V and w a symplectic form, in which case G,, 
is the symplectic linear group. Or W can be the space of skew-symmetric 
bilinear maps on V with values in V. Then w can be a cross product in 
dimension three or seven, or w can be the Lie bracket of a Lie algebra g = V 
and then Gy is the identity component in the group of automorphisms of g. 
The latter example will be examined in detail in §7.6.2. Of particular inter- 
est are the cases where the group Gy is noncompact. 
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Definition 7.5.15. An inner product (-,-) on V is called (p, w)-symmetric 
iff the Lie subalgebra gw C sl(V) is invariant under the involution A> A*, 
defined by (v, A*v’) := (Av,v') for v,v' EV. 


Exercise 7.5.16. Let (-,-) be a (p, w)-symmetric inner product on V. Prove 
that g € Gy implies g* € G,,. Hint: Choose a smooth path g: [0,1] > Gy 
with the endpoints g(0) = 1 and g(1) =g, and define €(t) := g(t)~'g(t). 
Show that the initial value problem h(t) = €(t)*h(t), h(0) = 1, has a unique 
solution h : [0,1] > Gy (Exercise 2.5.35) and that h(t) = g(t)* for all t. 


The following theorem asserts the existence of a (p, w)-symmetric inner 
product on V under an irreducibility assumption (see [17, Theorem 2]). 


Theorem 7.5.17. Assume that the group Gw in (7.5.34) acts irreducibly 


on V. Then there exists a (p,w)-symmetric inner product on V with the 
following properties. The subgroup 


Ky := Gy MSO(V) 


is connected and is a maximal compact subgroup of Gy. Moreover, every 
compact subgroup of Gw is conjugate in Gy to a Lie subgroup of Ky. Thus, 
if K is any maximal compact subgroup of Gy, there exists an element h € Gy 
such that K = hK,h-!. 


Proof. See Lemma 7.5.23 on page 383. 


Example 7.5.18. The hypothesis that the group G, acts irreducibly on V 
cannot be removed in Theorem 7.5.17. Consider the case where W = V 
has dimension at least two, the homomorphism p: SL(V) — SL(V) is the 
identity, and w € V is any nonzero vector. Then the one-dimensional linear 
subspace Rw C V is evidently invariant under the action of G,, and there 
does not exist any (~, w)-symmetric inner product on V. 


To begin with, we will fix any inner product (-,-)y on V and define 
the space Yo(V) of self-adjoint positive definite automorphisms of V with 
determinant one in terms of this fixed inner product. We will then use 
Theorem 7.5.12 to find an element P € Yo(V) such that the inner product 


(0,0) yp i= CHP) (7.5.35) 


on V satisfies the requirements of Theorem 7.5.17. The proof is based on 
three lemmas. The fourth lemma restates the theorem in a modified form. 
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Lemma 7.5.19. There exists an inner product (-,-)w on W such that 
p(SO(V)) C SO(W) and p(A*) = p(A)* for all A € sl(V). 

Proof. The proof follows the argument in [17, §3]._ Assume without loss 
of generality that V = R™ is equipped with the standard inner product, 
and that W = R” and that p: SL(m,R) > SL(n,R) is a Lie group homo- 
morphism. We prove first that there exists a unique Lie group homomor- 
phism p° : SL(m, C) — SL(n, C) (the complexification of p) such that 


P* lsum) = P P(A + iB) = p(A) + ip(B) (7.5.36) 
for all A,B € sl(m,R). Since SL(m,C) is connected, we can define p°(g) 
for g € GL(m,C) by choosing a smoth path a: [0,1] > SL(m,C) with the 
endpoints a(0) = Im and a(1) = g, and taking 


p°(g) := 81), —- B(s)“*B(s) = 6*(a(s)*a(s)),  B(0) = In. (7.5.37) 


To verify that 6(1) is independent of the choice of a, one can choose a path of 
paths [0, 1]? > SL(m,C) : (s,t) + a(s,t) satisfying a(0,t) = 1, a(1,t) =g, 
define S := a7!0,a and T := a !Qa, and define @: [0,1]?  SL(n,C) 
as the solution of the initial value problem 6~'0,8 = p°(S), 6(0,t) = Mn. 
Since p© is a Lie algebra homomorphism and 0,5 — 0,T = [S,T], it follows 
that 6~'d,8 = p°(T) and so @(1,t) is independent of t. Moreover, SL(m, C) 
retracts onto SU(m) by polar decomposition and so is simply connected by 
a standard homotopy argument. That the map p° : SL(m,C) > SL(n, .C) 
thus defined is smooth follows from the smooth dependence of solutions on 
the parameter in a smooth family of differential equations. That it is a group 
homomorphism follows by catenation of paths, and that it satisfies (7.5.36) 
follows directly from the definition. 

Now consider the action of the compact subgroup SU(m) C SL(m, C) on 
the Hadamard manifold 2p of positive definite Hermitian n x n-matrices Q 
of determinant one (Remark 6.5.20) by (g,Q) — p*(g)Qp°(g)*. This action 
is by isometries and hence, by Cartan’s Fixed Point Theorem 6.5.6, there 
exists an element Qo € Yo such that p°(g)Qop°(g)* = Qo for all g € SU(m). 
Differentiate this equation at g = 1,, to obtain p°(B)Qo + Qop*(B)* = 0 for 
every skew-Hermitian matrix B = —B* € sl(m,C) with trace zero. Now 
let A € sl(m,R), define R:= $(A— A‘), S:=4(A+A"), and take B=R 
and B =iS. Then p(R)Qo + Qop(R)' = 0 and £(S)Qo = Qop(S)'. Hence 
the positive definite symmetric n x n-matrix Q := Re(Qo) satisfies 


p(A') = p(-R+ S) = QA(R+ S)"Q* = Qp(A)'QT* 
for all A € s{(m,R). This shows that the inner product (w, w’) := w'Q7!w! 
on R” satisfies the requirements of Lemma 7.5.19. 
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In the remainder of this subsection we will fix an inner product on W 
as in Lemma 7.5.19. Recall also that we have already chosen a nonzero 
vector w € W. The norm squared of this vector determines a function on 
the space of inner products on V (see [17, Lemma 1)). 


Lemma 7.5.20. Define the function fw: Ao(V) > R by 
fu(P) = (w, p(P“)w) yy (7.5.38) 


for PE Al(V). Then an element P € Ao(V) is a critical point of fu if 
and only if (p(A)w, p(P~!)w)w =0 for all A € s(V). Moreover, if P is a 
critical point of fw, then the inner product (7.5.35) on V is (p, w)-symmetric. 


Proof. Fix an element P € Yo(V) and a tangent vector P € Tp o(V ): 
Then it follows from Lemma 7.5.19 that 


dfu(P)P = —(w, p(P-'P)o(P- yw) 
=— (A(PP™)w, (Pw) 
Now let A € sl(V) and take P := AP + PA* to obtain 
dfw(P)(AP + PA*) = —(p(A+ PA*P™')w, p(P™')w) 
= —2(6(A)w, p(P~"')w),y,- 


Thus P is a critical point of f, if and only if the right hand side of (7.5.39) 
vanishes for all A € sl(V). To prove the last assertion, define the norm 


jw'lw.p = V(w', p/P )w!)w 


for w’ € W. Now let P € Ao(g) be a critical point of fw, let € € sl(V), and 
take A := [€, Pé*P~1] € sI(V) in (7.5.39). Then 


0 = (A([E, PEP "})w, p(P~')w) y, = |A(PEP")wliv.p — |A(Q\wlive- 


If € € gw, then p(€)w = 0, hence p(PE*P~!)w = 0, and hence PE*P~! € gy. 
But the endomorphism Pé*P~! is the adjoint of € with respect to the inner 
product (7.5.35) on V and this proves Lemma 7.5.20. 


a 


a (7.5.39) 


Example 7.5.21. This example shows that the (p,w)-symmetry of the 
inner product (7.5.35) does not imply that P is a critical point of fy. 
Take W = V x V and let p: SL(V) > SL(W) be the diagonal action. As- 
sume dim(V) = 2 and choose w = (u,v) € W such that u,v € V are linearly 
independent. Then G, = {1} and so every inner product on V is (p, w)- 
symmetric (and the assertions of Theorem 7.5.17 are satisfied), however, the 
function f,,(P) = (u, P~tu)y +(v, P7'v)y does not have any critical point. 
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Let My C Ao(V) be the set of minima of the function fy, : Ao(V) > R 
in Lemma 7.5.20 and let Crit(fw) C Yo(V) be its set of critical points. The 
next result shows that fy, is convex and that Gy acts transitively on M,, 
whenever this set is nonempty (see [17, Lemma 2]). 


Lemma 7.5.22. The function fy has the following properties. 

(i) fw is conver and Gy-invariant, and thus My = Crit(fw). 

(ii) Fir two elements Po € My and P € Aop(V). Then P € My if and only if 
there exists an element n © gy such that n = Pon* Py" and exp(n) = PY, 
or equivalently p(PP,')w = w. 

(iii) The group Gw acts transitively on My. 

Proof. We prove part (i). Let y: R-> Ao(V) be a geodesic and define 


n 


P := (0), P :=¥(0), S:= P-V2pp V2, 
Then, by Lemma 6.5.17, y(t) = P!/* exp(tS)P!/2 and hence 


fu((t)) = (p(P~"?)w, exp(—ta($))o(P?)w)w. 
This implies 


2 
Sr hullt)) = (A(S)p(P~"/?)w, exp(—ta(S))A(S)o(P-")w)w > 0 


for all t and hence f,, is convex. Hence My = Crit(fw) by Lemma 7.5.9. 
That f, is Gw-invariant follows directly from the definition. This proves (i). 
We prove part (ii). If 7 € gw satisfies 7 = Pon*P)', exp(n) = PP)’, 
then p(PP)‘)w = w. If p(PP)')w = wv, then p(P~!)w = p(P5')w, hence 
(o(A)w, eP)w)w = (A(A)w, pCPS2)u)w = 0 
for all A € sl(V), and hence P € M,, by Lemma 7.5.20 and part (i). Now 
assume P € M,, and let 7: [0,1] > Ao(V) be the unique geodesic with the 
endpoints 7(0) = Po and 7(1) = Pe M,. Then y(t) € M, for all t by 
Lemma 7.5.9. Hence (6(7)w, p(y(t)~!)w)w = 0 for all t and all 7 € sl(V), 
by Lemma (7.5.20). Differentiate this equation at t = 0 to obtain 


(a(n)w, p(Py')(PPy')w)w =0, — P:= 4(0) € Tr, Po(V). 

Take 7 := Ppyt to obtain p(n)w = 0, and thus 7 € gw and 1 = Pon*Py". 

By Lemma 6.5.17 we also have P = y(1) = exp(PP5') Po = exp(7)Po and 
this proves (ii). 

We prove part (iii). Let Po, P € My, choose an element 7 € gy as in (ii) 

so that 7 Py) = Pon* and P = exp(n) Po, and define h := exp(n/2) € Gw to 

obtain P = hPoh*. This proves (iii) and Lemma 7.5.22. 
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With these preparations we are ready to prove Theorem 7.5.17. 


Lemma 7.5.23. (i) If Gy acts irreducibly on V, then My 4 9. 
(ii) If P € My, then the inner product (-,P~'-\y on V_ satisfies all the 
requirements of Theorem 7.5.17. 


Proof. The function fw is convex and G,-invariant by Lemma 7.5.22. Hence 
part (i) follows from Theorem 7.5.12. To prove part (ii), assume that P is 
a critical point of f,. Then, by Lemma 7.5.20, the inner product (-, P~'-)y 
is (p, w)-symmetric. We prove the remaining assertions in four steps. Define 
the group Kp := Gy NSO(V, (-, P1-)y) = {u € Gy |uPutP = I}. 

Step 1. Let KC Gy be any compact subgroup. Then there exists an ele- 
ment h € Gy such that h~'Kh c Kp. 


By Theorem 7.5.12, there exists a Py € Yo(V) such that 
fw(Po) = fw(P) 


= i Py. Pou* = Po for all u € K. 
pea ) uPou » for all u 
By Lemma 7.5.22, there exists an element h € G, such that Py9 = hPh*. 
Hence uhPh*u* = hPh* for all u € K, and hence the automorphism h~!uh 
is orthogonal with respect to the inner product (-,P~!-)y for every u € K. 


Step 2. Kp is a compact connected subgroup of Gy. 


By the Closed Subgroup Theorem 2.5.26 Kp is a closed, and hence compact, 
subgroup of SO(V, (-, P~!-)y) and so is a compact Lie subgroup of G,,. We 
prove that Kp is connected. Let ue Kp C Gy. Since Gy is connected, 
there exists a smooth path g : [0,1] >~ Gy such that g(0) = land g(1) =u. 
Thus, by Exercise 7.5.16, we have g(t)Pg(t)*P~! € Gy for all t. Hence, by 
part (ii) of Lemma 7.5.22, there exists a smooth path 7 : [0,1] > gw such 
that 7(t) = Pn(t)*P', exp(n(t)) = g(t)Pg(t)*P-!, and (0) = n(1) =0. 
Hence u(t) := exp(—7(t)/2)g(t) is a path in Kp joining u(0) = 1 to u(1) = u. 
Step 3. Kp is a maximal compact subgroup of Gy. 


Let K C Gy be a compact subgroup containing Kp. Then by Step 1 there 
exists an h € G, such that h~!Kh C Kp. Hence Kp C K C hKph~! and 
so Lie(Kp) C Lie(K) C hLie(Kp)h!. Since Lie(Kp) and hLie(Kp)h~! 
have the same dimension, this implies Lie(Kp) = hLie(Kp)h~!. Since Kp 
and hK ph! are connected, this implies Kp = hKph7! and so Kp = K. 
Step 4. Let K be a maximal compact subgroup of Gw. Then there exists an 
element h € Gy such that K = hKph"!. 

By Step 1 there exists an h € G,, such that h~'Kh C Kp, thus K C hKph7! 
and so K = hKph™!, because K is a maximal compact subgroup of G,,. This 
proves Step 4, Lemma 7.5.23, and Theorem 7.5.17. 
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Lemma 7.5.23 shows that all minima of the function f,, in Lemma 7.5.20 
give rise to inner products that satisfy the requirements of Theorem 7.5.17. 
The next lemma shows that the set of minima of f,, is a Hadamard manifold. 


Lemma 7.5.24. The set My is a geodesically convex and totally geodesic 
submanifold of Ao(V). Hence, if it is nonempty, it is a Hadamard manifold 
and a symmetric space. 


Proof. By Lemma 7.5.22 the function fy is convex, and so My is geodesically 
convex by Lemma 7.5.9. To prove that My is a totally geodesic submanifold 
of Ao(V) (and hence a Hadamard manifold), note that the exponential map 


Tp Po(V) > Ay(V) : Pwo exp(PP) ')Po 


is a diffeomorphism (Theorem 6.5.9 and Lemma 6.5.17), that My is the 
image of the linear subspace {P € Tp, Po(g) | PPy Y © gy} under this diffeo- 
morphism (Lemma (7.5.22)), and that Pp can be chosen to be any element 
of My. Moreover, the isometry o(V) > Ao(V) : P+ d0(P) = PoP 'Po 
in Step 2 of the proof of Theorem 6.5.9 satisfies 


~o (exp(PP,') Po) a exp(—PP;!) Po 


for all P € TpAp(V) and so restricts to an isometry of M,,. Hence My is a 
symmetric space and this proves Lemma 7.5.24. 


We emphasize that Lemma 7.5.24 does not require the hypothesis that 
the group Gy acts irreducibly on V. This hypothesis was only used to prove 
that the space M, is nonempty. The next example shows that f,, can have 
critical points in cases where Gy acts reducibly on V. 


Example 7.5.25. Let V = R?, let W = .Y C R?*? be the space of symmet- 
ric matrices, equipped with the standard inner product and the standard 
action S++ gSg' of SL(2,R), and let w = S := diag(1,—1) € Y so that 


w=i(5 a) 


The action of Gg on R? is reducible, because the diagonal in R? is Gg- 
invariant, however, the function fg(P) = trace(SP~!SP~') attains its min- 
imum on the set Ms = Gg C o(R?), corresponding to the symmetric in- 
ner products on R?. If one modifies this example by taking W = .Y x Y 
and w = (1,9), then f,,(P) = trace(P~? + SP~'SP~*) has a unique critical 
point at P = 1, the group G,, = {1} acts reducibly on V, and the assertions 
of Theorem 7.5.17 are trivially satisfied for every inner product on V. 


a>o,@— =a}. 
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Example 7.5.21 and Example 7.5.25 show that, in general, one cannot 
expect there to be a one-to-one correspondence between the minima of fy, 
and the (p, w)-symmetric inner products on V. However, we shall see below 
that such a one-to-one correspondence does exist in many cases. 


Remark 7.5.26. Once it is known, that the function f,, : Ao(V) > R has 
a critical point Py € A(V), we can modify the entire setup as follows. 
Replace the inner product on V by (-,:)vo := (Poy and the inner 
product on W by (-,-)wo := (, p(P) _):)w- This pair of inner products sat- 
isfies the requirements of Lemma 7.5.19. Let Aoo(V) be the space of self- 
adjoint positive definite endomorphisms with respect to the new inner prod- 
uct and define the function fw: Aoo(V) + R by the analogous formula. 
Then P € A(V) if and only if PP)’ € Ago(V) and fuo(PP) |) = fw(P) 
for all P€ Ap(V). Thus fyo attains its minimum at P = 1. 


In the next corollary we do not assume that G,, acts irreducibly on V. 


Corollary 7.5.27 (Cartan Decomposition). Assume the function fy in 
Lemma 7.5.20 has a critical point at Po = 11 and define 


Ky = Gel isOV ), Py =e du ln =F |. (7.5.40) 
Then the map 
Kw X Pw + Gw : (u, 7) > exp(7)u =: dw(u, 7) (7.5.41) 


is a diffeomorphism. Hence the map Gw > Ao(V): 9 +> Vgg* descends to 
a diffeomorphism from the quotient space Gw/Ky to My = Gun Ao(V). 


Proof. By part (ii) of Lemma 7.5.22 with Py = 1 the function f,, attains its 
minimum on the set My = GwM Yo(g). Now define the map 


Vw : Gw > Ky x Pw 
by tw(g) := (u,7), where 
= 5 exp '(gg*) E pw, u:=exp(—n)g € Kw 


for g € Gy. This map is well defined and smooth because, for every g € Gy, 
we have gg* € GwM Yo(V) = My (see Exercise 7.5.16), and the exponential 
map descends to a diffeomorphism exp : py — My (see Lemma 7.5.22). 
Since dy Oy = id and Wy Oo dy = id, it follows that ¢y is a diffeomorphism. 
This proves Corollary 7.5.27. 
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As a warmup for the main application of Theorem 7.5.17 in §7.6 it may 
be useful to consider the following two examples. 


Exercise 7.5.28. (i) Let V be an m-dimensional real vector space and 
W :=S°V* 


be the vector space of all symmetric bilinear forms Q:V x V > R. De- 
fine the homomorphism p : SL(V) + SL(W) by the standard action of the 
group SL(V) on W, ice. 


(p(9)Q) (v, v') = Q(g7*v, 97 *v’) 


for g € SL(V), Q € S?V*, and v,v' € V. Assume that V is equipped with 
an inner product and an orthonormal basis e1,...,@m, and define an inner 
product on W by (Q,Q’) := ees Q(ei, €;)Q" (ei, e;) for Q, Q’ € S?V*. Show 
that this inner product is independent of the choice of the orthonormal basis 
and satisfies the requirements of Lemma, 7.5.19. 

(ii) The inner product on V is an element Qo € W whose isotropy sub- 
group is the special orthogonal group SO(V). Show that the function fg, 
in (7.5.38) is given by fg,(P) = trace(P?) for P € Ap(V) and that it has a 
unique critical point at P = 1. 


(iii) Examine the case where V = R™*! is equipped with the standard inner 
product and Q € W is the quadratic form in (6.4.9). Relate this example to 


the isometry group of hyperbolic space (§6.4.3). Find a maximal compact 


subgroup of the identity component of O(m, 1). 


Exercise 7.5.29. (i) Let V be a 2n-dimensional real vector space and 
W = A’vV* 


be the vector space of all skew-symmetric bilinear forms 7: V x V > R. 
Define the homomorphism p: SL(V) + SL(W) by the standard action of 
the group SL(V) on W, ice. 
(o(g)r) (v, 0") = (ger) (0, 0") = r(g7*0, 71’) 
for g € SL(V), 7 € A?V*, and v,v’ € V. Assume that V is equipped with 
an inner product and an orthonormal basis e€),...,¢2n, and define an inner 
product on W by (7,7’) := 30; 7 (ei, e;)7'(ei,e;) for 7,7' € A?V*. Show 
that this inner product is independent of the choice of the orthonormal 
basis and satisfies the requirements of Lemma 7.5.19. 
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(ii) Let w: V x V +R be a nondegenerate skew-symmetric bilinear form. 
Then the pair (V,w) is called a symplectic vector space and w is called 
a symplectic form on V. The isotropy subgroup of w in GL(V) is called 
the symplectic linear group. Denote this group and its Lie algebra by 

Sp(V, Ww) {9 € GL(V) | w(9-,9°) = w} ’ 

sp(V,w) := Lie(Sp(V,w)) = {A € End(V) | w(A-,-) + w(-, A-) =O}. 
The group Sp(V,w) is connected and contained in SL(V) (see [49]). An au- 
tomorphism J : V — V is called a linear complex structure iff J? = —1. 
A linear complex structure J is called compatible with w iff the bilin- 
ear form w(-,J-) is an inner product on V. An inner product (-,-) on V 
is called compatible with w, iff there exists a linear complex structure J 
such that w(-, J-) = (-,-). Prove that an inner product on V is compatible 


with w if and only if it satisfies the conditions (for any basis €1,..., €2n of V) 
det (w(e;,e;)) = det((e;, e;)), (7.5.42) 

A € sp(V,w) = A* € sp(V,w). (7.5.43) 

If an inner product on V is compatible with w, prove that g € Sp(V,w) 


implies g* € Sp(V,w) (without using the fact that Sp(V,w) is connected). 
Hint: Define J € End(V) by w(-,J-) := (-,-) and show that J+ J* =0. 
Show that A € sp(V,w) if and only if AJ + JA* =0. Use (7.5.43) to prove 
that J? commutes with every self-adjoint endomorphism of V and hence 
satisfies J? = All for some \ € R. Use (7.5.42) to conclude that 4 = —1. 
(iii) Fix an inner product on V and a symplectic form w:V x V > R that 
satisfies (7.5.42). Then the function f,, in (7.5.38) is given by 


fulP) = >" w(ei,e;)w(Pei, Pes), Pe Po(V), (7.5.44) 
a,j 
where e1,...,€2n is an orthonormal basis of V. Prove that this is the norm 


squared of w with respect to the inner product (-,P~!-). Prove that P is 
a critical point of f,, if and only if the inner product (-,P~!-) is compati- 
ble with w. Prove that 7(V,w) is a Hadamard manifold and a symmetric 
space (Exercise 6.5.23). Prove that, if J € 7(V,w), then the unitary group 
UG) = {g € Sp(V,w) | gJg* = J} is a maximal compact subgroup 
of Sp(V,w) and that every compact subgroup of Sp(V,w) is conjugate to a 
subgroup of U(V,w, J). All this is of course well known, but this exercise 
shows how these results can be derived from Theorem 7.5.17. Moreover, it 
is not necessary to assume that Sp(V,w) is connected. One can start with 
the identity component of Sp(V,w), prove that it is contained in SL(V), and 
deduce the connectivity of Sp(V,w) from that of the unitary group. 
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7.6 Semisimple Lie Algebras* 


This section discusses applications of the results in §7.5.3 to Lie algebra 
theory, following the work of Donaldson [17]. It examines symmetric inner 
products on Lie algebras (§7.6.1), establishes their existence on simple Lie 
algebras (§7.6.2), and derives as consequences several standard results in Lie 
algebra theory, such as the uniqueness of maximal compact subgroups up 
to conjugation for semisimple Lie algebras (§7.6.3), and Cartan’s theorem 


about the compact real form of a semisimple complex Lie algebra (87.6.4). 

Here are some basic definitions that will be used throughout this section. 
Let g be a finite-dimensional real Lie algebra (Definition 2.4.22). A vector 
subspace h C g is called an ideal iff [€,] € b for every € € g and every 7 € b. 
The Lie algebra g is called abelian iff the Lie bracket vanishes. It is called 
simple iff it is not abelian and does not contain any ideal other than h = {0} 
and h = g. Examples of ideals in any Lie algebra g are the center Z(g) 


(Exercise 2.5.33) and the commutant [g, g] (Exercise 5.2.23), defined by 


Z(g) = {€ € g|[&.7] =0 for all n € g}, 

[9,9] -= span {[€,7]|€.7 € g}. 
Recall also the definition of the adjoint representation ad : g > Der(g) in 
Example 2.5.22 by ad(&) := [€,-] for € € g and the definition of the Killing 
form & : g X g > R in Example 5.2.24 by «(&,7) := trace(ad(&)ad(7)) 
for £,7 € g. Let Auto(g) be the connected component of the identity in the 
group Aut(g) of automorphisms of g. This is a Lie subgroup of GL(g) whose 
Lie algebra is the space Lie(Auto(g)) = Der(g) of derivations on g. 


7.6.1 Symmetric Inner Products 


In [17] Donaldson introduced the following notion. 


Definition 7.6.1. Let g be a finite-dimensional real Lie algebra. An inner 
product (-,-) on g is called symmetric iff it satisfies the condition 


6 € Der(g) —— d* € Der(g). (7.6.1) 
Here 6* :g — g denotes the adjoint of the endomorphism 6 with respect to 
the inner product, i.e. it satisfies (€,0*n) = (0&,) for all €,n € g. 


Exercise 7.6.2. Let g be a finite-dimensional real Lie algebra equipped with 
a symmetric inner product. Prove that g € Auto(g) = > g* ©€ Auto(g). 
Hint: See Exercise 7.5.16. 


Some consequences of the existence of a symmetric inner product are 
derived in Lemma 7.6.8 below. To begin with we discuss some examples. 
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Every vector space endomorphism of a Lie algebra g that takes values 
in the center Z(g) and vanishes on the commutant [g,g] is a derivation. 
Conversely, every derivation that takes values in Z(g) necessarily vanishes 
on [g,g]. The space of such derivations is an ideal in Der(g), denoted by 


Derz(g) := {6 € Der(g) | im(5) C Z(g)}. (7.6.2) 


Example 7.6.3. Consider the abelian Lie algebra g = R™”. The adjoint 
representation is trivial and Der(g) = gl(m,R) = R™*"™ is the Lie algebra 
of all vector space endomorphisms of g. Thus Der(g) = Derz(g), the Killing 
form on g vanishes, and every inner product on g is symmetric. 


Example 7.6.4. Consider the Lie algebra g = gl(m,R) with [g, g] = sl(g) 
and Z(g) = RI. It satisfies g = [g, g] © Z(g) and Der(g) = ad(g) © Derz(g), 
where Derz(g) C Der(g) is the one-dimensional subspace generated by the 
derivation 5z(A) = trace(A)1 (whose trace is m). Moreover, the standard 
inner product (A, A’) = trace(A' A’) on g is symmetric and the kernel of the 
Killing form «(A, A’) = 2mtrace(AA’) — 2trace(A)trace(A’) is Z(g). 
Example 7.6.5. Consider the Lie algebra g = gl(m,R) x R™ with the 
Lie bracket [(A, v), (A’,v’)] := ([A, A], Av’! — A’v). This Lie algebra can be 
identified with the space of all affine vector fields on R™. It has a triv- 
ial center and the commutant [g,g] = sl(m,R) x R™ has codimension one. 
Moreover, trace(ad(A, v)) = trace(A) for every (A,v) € g, the kernel of the 
Killing form «((A, v), (A’, v’)) = (2m + 1)trace(AA’) — 2trace(A)trace(A’) is 
the abelian ideal {0} x R™, and the adjoint representation ad : g > Der(g) 
is a Lie algebra isomorphism. 


Example 7.6.6. Consider the Heisenberg algebra h = V x R of a symplectic 
vector space (V,w) with the Lie bracket [(v,t), (v’,t’)] = (0,w(v,v’)) (see 
Exercise 2.5.15). It satisfies Z7(h) = [6,6] = {0} x R and the Killing form 
vanishes. Every derivation on h has the form 6(v,t) = (Av+ Av, A(v) +2At), 
where A € R, A € V*, and A€ sp(V,w). The subspace ad(h) = Derz(h) 
consists of all derivations of the form d(v, t) = (0, A(v)). 


Example 7.6.7. The Heisenberg algebra of a symplectic vector space (V, w) 
extends to a Lie algebra g = sp(V,w) x V x R with the Lie bracket 


(4.0.1), (A0,t)] = (4, 41], Av! — 40, 0(0,04)). 
It satisfies [g, g]) = g and has a one-dimensional center Z(g) = {0} x {0} x R. 


The kernel of the Killing form is the Heisenberg algebra and the adjoint 
representation ad : g + Der(g) is surjective. 


The next lemma shows that the Lie algebras in Examples 7.6.5, 7.6.6, 
and 7.6.7 do not admit symmetric inner products. 
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Lemma 7.6.8. Let g be a finite-dimensional real Lie algebra equipped with 
a symmetric inner product. Then 


Der(g) = ad(g) ®Derz(g), 9 = [g, 9] © 2(g), (7.6.3) 


the kernel of the Killing form &:g x g > R is the center of g, and there 
exists an involution g > g: & ++ &* such that, for all €,n € g, 


ad(é") =ad(€)", —[€, |" = [n", €"]. (7.6.4) 
Proof. Consider the orthogonal decomposition 
Der(g)=A®B, A:=ad(g), B= Ar, (7.6.5) 


with respect to the inner product (6,6’) = trace(d*6’) for 6,6’ € Der(g). 
Since [6, ad(€)] = ad(6&) for 6 € Der(g) and € € g, the subspace A is an ideal 
in Der(g). Moreover, if 6 € B and e € Der(g), then e* € Der(g), hence 
trace([e, ]*ad(€)) = trace(d*[e*, ad(€)]) = trace(d*ad(e*€)) = 0 for all € € g, 
and hence [e, 6] € B. Thus B is also an ideal in Der(g). 

Next define Derz(g)* := {6*|6 € Derz(g)}. We prove that 


B = Derz(g) = Derz(g)*. (7.6.6) 


Since A and B are ideals we have [6, 6’] = 0 for all 6 € B and 6’ € A. Thus 
ad(6é) = [6,ad(€)] = 0 for all 6 € B and € €g, hence im(d) C Z(g) for 
all 6 € B, and so B C Derz(g). Now let 6 € Derz(g)*. Then 6* € Derz(g), 
hence [g, g] C ker(6*), hence trace(d*ad(€)) = 0 for all € € g, and sod € B. 
Thus Derz(g)* C B C Derz(g) and so (7.6.6) holds for dimensional reasons. 
The first equation in (7.6.3) follows directly from (7.6.5) and (7.6.6). It fol- 
lows also from (7.6.6) that trace(d*ad(&)*) = trace(ad(€)d) = 0 for all € € g 
and all 6 € B, and so ad(€)* € B+ = A for all € € g. Thus 


dEA = OE A, (7.6.7) 


By (7.6.7), an element ¢ € g belongs to Z(g) if and only if ad(€)*¢ = 0 for 
all € € g if and only if (¢, ad(€)n) = 0 for all €,7 € g, if and only if ¢ € [g, g|+. 
Thus [g, g]t = Z(g) and this proves the second equation in (7.6.3). 

By (7.6.3), the map ad: g — Der(g) restricts to a Lie algebra isomor- 
phism from [g,g] to A. Hence, by (7.6.7) there exists a unique involution 


(9, 9] > [g, 9] : § + &* that satisfies (7.6.4) for all €,7 € [g,g]. By (7.6.3) this 
involution extends uniquely to an involution g > g: € + &* such that (* =¢ 
for all ¢ € Z(g), and the extended involution satisfies (7.6.4) for all €,7 € g. 

Now let ¢ € g belong to the kernel of the Killing form, ie. K(¢,€) = 0 
for all € € g. Then |ad(¢)|? = «(¢, ¢*) = 0 and hence ¢ € Z(g). Conversely, 
it follows directly from the definitions that Z(g) is contained in the kernel 


of the Killing form and this proves Lemma 7.6.8. 
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7.6.2 Simple Lie Algebras 


The goal of this section is to establish the existence of symmetric inner 
products on simple Lie algebras. First, the following lemma derives some 
immediate consequences of the definition of a simple Lie algebra. 


Lemma 7.6.9. Let g be a finite-dimensional simple real Lie algebra. Then 
the center of g is trivial, the adjoint representation ad : g — Der(g) is 
injective, the commutant is [g, g] = g, and trace(ad(€)) = 0 for all € € g. 


Proof. The center Z(g) is an ideal in g. It is not equal to g because g is not 
abelian, and hence Z(g) = {0} because g is simple. The subspace |g, g] is 
also an ideal in g. It is nonzero because g is not abelian, and hence [g, g] = g 
because g is simple. The adjoint representation ad : g — Der(g) is injec- 
tive because its kernel is the center of g. The last assertion follows from 
the fact that [g,g] = g and trace(ad((€,7])) = trace({ad(€), ad(7)]) = 0 for 
all €,7 € g. This proves Lemma 7.6.9. 


That the Killing form of a simple Lie algebra is nondegenerate is a deeper 
result that does not follow directly from the definition. In [17, Theorem 1] 
Donaldson deduces the existence of symmetric inner products on simple Lie 
algebras from Theorem 7.5.17 and derives as corollaries various standard 
results in Lie algebra theory, including nondegeneracy of the Killing form. 


Theorem 7.6.10. Let g be a finite-dimensional simple real Lie algebra. 
Then the Killing form on g is nondegenerate, the adjoint representation 
ad : g — Der(g) is bijective, every derivation 6: g > g has trace zero, and 
every automorphism in the identity component Auto(g) has determinant one. 


Proof. See page 394. 


In particular, Theorem 7.6.10 establishes for every simple Lie algebra g 
the existence of a connected Lie group Auto(g) C SL(g) whose Lie algebra 
is isomorphic to g. 


Theorem 7.6.11 (Donaldson). Every finite-dimensional simple real Lie 
algebra admits a symmetric inner product. Moreover, if SO(g) is the special 
orthogonal group associated to a symmetric inner product on g, then 


K := Auto(g) M SO(g) (7.6.8) 


is connected and is a maximal compact subgroup of Auto(g), every compact 
subgroup of Auto(g) is conjugate to a Lie subgroup of K, and every maximal 
compact subgroup of Auto(g) is conjugate to K. 


Proof. See page 394. 
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Given an inner product (-,-) on any m-dimensional Lie algebra g and an 
orthonormal basis e€1,...,@m of g, define the function fy : Ao(g) > R by 


AP) => lene], P [Pein Pe) (7.6.9) 
ij 


for P € Yo(g). The right hand side of (7.6.9) is independent of the choice 


of the orthonormal basis and is the norm squared of the Lie bracket with 
respect to the inner product (-,P~!-) on g. 


Theorem 7.6.12 (Donaldson). Let g be a finite-dimensional simple real 
Lie algebra equipped with a symmetric inner product. Then the set 


M, := {P € P(g) | dfy(P) = 0} = {P € Po(g)| fo(P) = inf fy} 
= A(g)N Aut(g) = {exp() | 6 € Der(g), 6 = 6*} (7.6.10) 
= {P € Ag) | the inner product (-,P~'-) is symmetric} 


of critical points of fy is a geodesically convex and totally geodesic subman- 
ifold of Ao(g). Hence it is a Hadamard manifold and a symmetric space. 


Proof. See page 394. 


The proofs require three preparatory lemmas. We do not assume that 
every derivation has trace zero. Thus it is necessary as an intermediate step 
to introduce the subspace Dero(g) := {6 € Der(g) | trace(d) = 0}. We will 
consider inner products on g that satisfy the condition 


6 € Dero(g) —y d* € Dero(g). (7.6.11) 


Lemma 7.6.13. Let g be a finite-dimensional real Lie algebra satisfying 
the conditions Z(g) = {0} and |g, g] = g, and fix an inner product (-,-) ong. 
Then the inner product is symmetric if and only if it satsfies (7.6.11). More- 
over, if such an inner product exists, then Der(g) = ad(g) = Dero(g). 


Proof. Assume (7.6.11) and consider the decomposition Derg(g) = Ao © Bo, 
where Ag := ad(g) C Dero(g) (because [g,g] = g) and Bo := Ag are ideals 
in Dero(g) as in the proof of Lemma 7.6.8. Hence ad(d&) = [6, ad(&)] = 0 for 
all 5 € Bo and all € € g. Since Z(g) = {0}, this implies By = 0, and hence the 
adjoint representation ad : g + Derg(g) is bijective. By (7.6.11), this implies 
the existence of an involution € ++ €* such that ad(é*) = ad(€)* for all € € g. 
Since the adjoint representation ad : g > Der(g) is injective, this in turn im- 
plies that the Killing form is nondegenerate and so Der(g) = ad(g) = Dero(g) 
by Lemma 7.4.3. Thus the inner product on g is symmetric. Conversely, 
if the inner product on g is symmetric, then it satisfies (7.6.11) because 6 
and 6* have the same trace. This proves Lemma 7.6.13. 
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Lemma 7.6.14. Let g be a finite-dimensional simple real Lie algebra with 
a symmetric inner product, and let g > g:& +> &* be the unique involution 
that satisfies (7.6.4). Then there exists a constant c > 0 such that 


K(E",n) =c(E,n) for all €,n € g. (7.6.12) 


Proof. By Lemma 7.6.9 the adjoint representation ad : g —> Der(g) is injec- 
tive. Hence the map g x g > R: (€,7) } «(&*,7) = trace(ad(&)*ad(7)) is 
an inner product on g. Thus there exists a self-adjoint positive definite vec- 
tor space isomorphism A : g — g such that «(€*,7) = (A&,7) for all €,7 € g. 
Let c > 0 be an eigenvalue of A and define h := {n Eg | An = cn} . We prove 
that § is an ideal in g. Let € € g and 7 € h. Then, for all ¢ € g, 


(A[E, 1,6) = K([E,]*, 0) = «(In*, £7], ¢) = w(n*, [€*, ¢]) = (An, [&*, ¢]) 
= (An, ad(§")¢) = (ad(€)An, ¢) = ([€, An], ¢) = e([é, m1, ¢)- 


Hence A[€,7] = c[€,7] and so [€,7] € 6. This shows that is a nonzero ideal 
and hence = g. Thus A = cll and this proves Lemma 7.6.14. 


Given any inner product on g, call an element P € Yo(g) symmetric 
iff the inner product (-,P~!-) on g is symmetric. Thus every symmetric 
element P € #o(g) determines an involution tp : Der(g) — Der(g) given 
by Tp(6) := P6*P~+. Denote its determinant by ep := det(tp) € {-1, +1}. 


Lemma 7.6.15. Let g be a finite-dimensional simple real Lie algebra, choose 
any inner product on g, and let P,Py € PYo(g) be symmetric elements. 
Then PP, ' € Aut(g). 


Proof. The composition of the involutions Tp and Tp, is the Lie algebra au- 
tomorphism TpoTp,(d) = Por) eo = PP,'5PoP~! for 6 € Der(g). 
By Lemma 7.6.8 and Lemma 7.6.9 the very existence of a symmetric inner 
product on g implies that the adjoint representation ad : g — Der(g) is a 
Lie algebra isomorphism. Hence there exists a g € Aut(g) such that 


ad(gé) = PPy'tad(€)PoP~' for all Eg. (7.6.13) 
Since ad(g€) = gad(€)g~', this automorphism satisfies the equations 
g PP, E.nl=(E,g "PP 'n] for all €,n € 9, (7.6.14) 


det(g) = epern, € {-1, +1}. (7.6.15) 


394 CHAPTER 7. TOPICS INGEOMETRY 


Since Po is symmetric, it follows from Lemma 7.6.8 and Lemma 7.6.14 that 
there exists an involution g > g: € + &* and a constant c > 0 such that 


ad(é*) = Pyad(€)*Pp*, —-&(€*, 9) = e(€, Py *n) (7.6.16) 
for all €,7 € g. By (7.6.13) and (7.6.16) we have 


n((g€)",9) = trace(Poad(gé)* Py ‘ad(n)) 

= trace((PPj ‘ad(é)PoP~')* Pj 'ad(n) Po) 

= trace(P~! Poad(£)* Po 'PPy‘ad(n) Po) 

= trace(ad(£*) PP, 'ad(n)PoP~*) 

= trace(ad(é*)ad(gn)) 

= «(£*, gn). 
This shows that g is self-adjoint and positive definite with respect to the 
inner product (-, P)':), and so det(g) =1 by (7.6.15). Since PP,’ is self- 
adjoint and positive definite with respect to the same inner product, the 
vector space isomorphism g~!PP, Lg V2(g a ad a 1971/2) 1/2 has only 
positive real eigenvalues. Let A > 0 be one such eigenvalue. Then the 
eigenspace ker(All — g~! PP, ‘) is a nonzero ideal in g by (7.6.14), and so 
is equal to g, because g is simple. Thus go Pr, = All, and since g, P, Po 
all have determinant one, it follows that \ = 1 and so PP; | = g € Aut(g). 
This proves Lemma 7.6.15. 


Proof of Theorems 7.6.10, 7.6.11, and 7.6.12. We use the results of §7.5.3 
in the situation where V := g is the Lie algebra itself and W := A’g* @ g is 
the space of all skew-symmetric bilinear maps 7 : g x g > g. The Lie group 
homomorphism p: SL(g) + SL(W) is given by the standard action of the 
group SL(g) on W, ice. 


(p(g)T)(E,n) = gt (9 *E, 9°) 
for g € SL(g), 7 € W, and €,7 € g. Fix an inner product (-,-) on g and an 


orthonormal basis e1,...,@m of g, and define 
(0,7)w = >_(o(ei, e;), T (ei, €;)) (7.6.17) 
1,J 


for 0,7 € W. This inner product satisfies the requirements of Lemma 7.5.19, 
i.e. p(A*) = p(A)* for all A € sl(g). The vector w := [-,-] € W is chosen 
to be the Lie bracket. This vector is nonzero because g is not abelian. The 
isotropy subgroup of w is the group Aut(g)MSL(g) of all automorphisms of 
determinant one. Denote its identity component by G C Auto(g) MN SL(g). 
This is a Lie subgroup of SL(g) with the Lie algebra Lie(G) = Dero(g). 
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Claim 1. G acts irreducibly on g. 


Let h C g be asubspace that is invariant under the action of G. Then 6h Cc h 
for every 6 € Dero(g). By Lemma 7.6.9 this implies ad(€)h C 6 for all € € g, 
so § is an ideal. Thus h = {0} or h = g because g is simple. 


Claim 2. f, is conver and G-invariant and has a critical point. Moreover, 
every critical point P € Ao(g) is symmetric, and Dero(g) = Der(g). 


In the present setting the function f,, in Lemma 7.5.20 agrees with the 
function f, in (7.6.9) and G,, = G. Hence f, is convex and G-invariant by 
Lemma 7.5.22, and G acts irreducibly on g by Claim 1. Thus Lemma 7.5.23 
asserts that fg has a critical point. Let P € Ao(g) be a critical point of fo. 
Then by Lemma 7.5.23 the inner product (-, P~1-) satisfies (7.6.11) and so, 
by Lemma 7.6.13, this inner product is symmetric and Der(g) = Dero(g). 


Claim 3. Fiz a critical point Po € Po(g) of fy and any element P € Po(g). 
Then the following are equivalent. 


(a) P is a critical point of fy. 

(b) f4(P) = inf fo. 

(c) PP) € Aut(g). 

(d) There exists a 6 € Der(g) such that 6 = Pyd*P)', exp(6) = PP,". 
(e) The inner product (-,P~'-) on g is symmeric. 


Since fg is convex by Claim 2, the equivalence of (a) and (b) follows from 
Lemma 7.5.9. Since Der(g) = Dero(g) by Claim 2, the equivalence of (b), 
(c), and (d) follows from part (ii) of Lemma 7.5.22. Moreover, (a) implies (e) 
by Claim 2, and (e) implies (c) by Lemma 7.6.15. This proves Claim 3. 


The existence of a symmetric inner product on g was proved in Claim 2. 
Thus the nondegeneracy of the Killing form follows from Lemma 7.6.8. 
The remaining assertions of Theorem 7.6.10 are direct consequences of the 
nondegeneracy of the Killing form. In particular, the adjoint represen- 
tation ad: g > Der(g) is bijective and Der(g) C sl(g) by Lemma 7.4.3. 
Hence Auto(g) C SL(g) and so G,, = G = Auto(g) in the notation of §7.5.3. 

Now fix a symmetric inner product on g. Then Po = 1 is a critical point 
of fg by “(e) => (a)” in Claim 3. Hence the assertions about the sub- 
group K = Auto(g) MSO(g) in Theorem 7.6.11 follow from Lemma 7.5.23. 
The equalities in (7.6.10) follow from the equivalence of (a), (b), (c), (d), (e) 
in Claim 3 with Py = 1, and the remaining assertions about the space M, 
in Theorem 7.6.12 follow from Lemma 7.5.24. This completes the proof of 
Theorems 7.6.10, 7.6.11, and 7.6.12. 
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7.6.3 Semisimple Lie Algebras 


The following theorem characterizes semisimple Lie algebras in terms of 
symmetric inner products. First observe that, if h is an ideal in a finite- 
dimensional real Lie algebra g, then the subspace 


b’ := {n' € g| K(n, 7) =0 for all n € 5} (7.6.18) 


is also an ideal, because every 7’ € h’ satisfies K(n, |€,17']) = K([n, €], 7’) = 0 
for all € € g and all 7 € b, and so [€,7'] € h’ for all € Eg. 


Theorem 7.6.16 (Semisimple Lie algebras). Let g be a finite-dimen- 
sional real Lie algebra. Then the following are equivalent. 


(i) g has a trivial center and admits a symmetric inner product. 
(ii) The Killing form &: g x g > R is nondegenerate. 

(iii) If h C g is an ideal, theng=6h OO’. 

(iv) g is a direct sum of simple ideals. 


(v) There exists an inner product (-,-) on g, an involution g > 9:&H &, 
and a constant c > 0 such that, for all €,n € g, 


ad(€*) = ad(€)*, [E.m)° = [ne], K(€*,n) =e(€,n). (7.6.19) 


Definition 7.6.17. A real Lie algebra g is called semisimple iff it is finite- 
dimensional and satisfies the equivalent conditions in Theorem 7.6.16. 


Proof of Theorem 7.6.16. That (i) implies (ii) was shown in Lemma 7.6.8. 
We prove that (ii) is equivalent to (iii). Assume first that the Killing 
form is nondegenerate and let  C g be an ideal. Then 


yeh: wey = [n, n'] = 0, (7.6.20) 


because «((7, 7], €) = «(n, [7’, €]) = 0 for all € € g, all 7 € b, and all 7 € by’. 
Now let 7 € 6M’. Then 7’ := ad(€)ad(n)¢ = [E€, [7, ¢]] € 6’ for all €,¢ € g. 
Hence, by (7.6.20) we have (ad(€)ad(n))?¢ = [€, [n, 7’]] = 0 for all €,¢ € g. 
Thus (ad(€)ad(7))? = 0 and so «(€,7) = trace(ad(€)ad(7)) = 0 for all € € g. 
This implies 7 = 0 by nondegeneracy of the Killing form. Thus Mb’ = {0} 
and so g =) @h’ because dim(h) + dim(h’) = dim(g). This shows that (ii) 
implies (iii). To prove the converse, take h = g so that h’ = {0} is the kernel 
of the Killing form. 

It follows from (ii) and (iii) that, for every ideal h C g, the Killing forms 
of h and 6’ are both nondegenerate. Hence an induction argument shows 
that (iii) implies (iv). 
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We prove that (iv) implies (v). Assume that g = 91 ®---@g, is a di- 
rect sum of simple ideals g; C g. Fix an index j € {1,...,r} and denote 
by &j : 9; X gj; — R the Killing form of g;. By Theorem 7.6.11 the Lie alge- 
bra g; admits a symmetric inner product (-,-);.. Hence, by Lemma 7.6.8 
there exists an involution g; > g;:€++ €* that satisfies (7.6.4), and by 
Lemma 7.6.14 there exists a constant cj > 0 such that «;(€*,7) = cj (€, 1); 
for all €,7 € gj. Thus the inner product (€,7) := )7; ¢j(&j,j)5 for €), my € 9; 
and €=&4+---+&,7 =m +-::+7, satisfies the requirements of part (v) 
with ¢= 1 and €* =f) e+ €. 

If (v) holds, then the Killing form is nondegenerate, hence Z(g) = 0 
and Der(g) = ad(g) by Lemma 7.4.3, and hence the inner product in (v) is 
symmetric. Thus (v) implies (i) and this proves Theorem 7.6.16. 


Corollary 7.6.18 (Cartan Involution). Let g be a semisimple real Lie 
algebra equipped with a symmetric inner product and let € + &* be the invo- 
lution in Lemma 7.6.8. Then the map 


ae ee eae (7.6.21) 


is a Lie algebra homomorphism (called a Cartan involution). The Cartan 
involution (7.6.21) gives rise to a splitting 


g=€Op, §:={€eglE+E°=0}, pr={negln=n"}, (7.6.22) 
such that 
em 2 [é, p] C Pp, [pp] C &. (7.6.23) 


Moreover, € is nontrivial, the Killing form &:g x g > R is negative definite 
on € and positive definite on p, and K(€,7) = 0 for all € € € and all n € p. 


Proof. That the map (7.6.21) is a Lie algebra homomorphism follows di- 
rectly from (7.6.4). It follows also from (7.6.4) that the subspaces t,p C g 
in (7.6.22) satisfy (7.6.23). That g is the direct sum of these subspaces, 
follows from the identity ¢** = ¢ for ¢ € g, which implies 


C=E+n, E:=5(C-C)et n= 5(C +0) ep. 


By (7.6.23) the summand £ must be nontrivial, because g is not abelian. 
Next observe that the formula (A,B) := trace(A*B) defines an inner 


product on End(g) with the norm |A| := ,/trace(A*A), and that 
K(E,7") = K(&*,n) = trace(ad(€)*ad(7)) = (ad(€), ad()) . 
For € € ¢ and 7) € p this implies «(€,€) = —|ad(€)|?, «(n,7) = |ad(n)|?, 


and «(€,7) = 0. This proves Corollary 7.6.18. 
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Lemma 7.6.19. Let g be a semisimple real Lie algebra equipped with a 
symmetric inner product, let € ++ €* be the involution in Lemma 7.6.8, 
let g = gi B--- Pg, be a decomposition into simple ideals g;, and let €,p C g 
be as in Corollary 7.6.18. Then the following holds. 


(i) The simple ideals g; C g are pairwise orthogonal. 
(ii) Each ideal g; is invariant under the involution € > &. 
(iii) p is the orthogonal complement of €. 


(iv) For each j the restriction of the inner product to g; is symmetric 
and g; =*; Op;, where §; =€Ng;, pj =pNgj are as in Corollary 7.6.18. 


Proof. We prove part (i). For each i the orthogonal complement g} is an 
ideal, because ([£,7],¢) = (n, [€*,¢]) =0 for all €€ g, n € gt, ¢ € gi, and 
so [€,n] € gt for all € € g, 7 € gt. This implies that 6; := Nis; g; is an 
ideal, and so is the subspace hj gj. So either h; 1g; = {0} or bj = gj, 
because g; is simple. If h; 1g; ={O}, then [€,7] =O for all € € g; and 
all 7 € 6;, hence g; C Z(g), and this is impossible because the center of g is 
trivial. Thus 5; = g; for all j and this proves (i). 

We prove part (ii). The subspace gj := {7*|7 € gj} is an ideal, be- 
cause [€,7*] = [n,€*]* € gj for all €€g and all 7 €g;. By part (i) we 
have ([€,7*],¢) = —(€,[n,¢]) =0 for all €€ g; and 7 € g; with i# J, and 
all ¢ € g. Hence (gi, 95] = (0 for i#j. Hence the ideal gj gj; cannot be 
zero, because otherwise [g;,g;] = 0 and so gj C Z(g). Hence gj = gj, 
because gj; is simple. This proves (ii). 

We prove part (iii). By (ii) and Lemma 7.6.14 the involution € + &* 
preserves the inner product on gj, and so by (i) it preserves the inner product 
on all of g. Hence (€,7) = (€*,*) = —(€,n) for all€ € fandalln € p. Thust 
and p are orthogonal to each other and this proves (iii). 

Part (iv) follows directly from (ii) and this proves Lemma 7.6.19. 


Theorem 7.6.20. Let g be an m-dimensional real Lie algebra that is not 
abelian and fix an inner product on g and an orthonormal basis e1,...,€m 
of g. Then the following are equivalent. 
(i) P= 11 is a critical point of fy. 
(ii) There exists a real number c such that 
m 
Yd (2ad(ei)*aa(er) — ad(e;)ad(ci)*) = <ll, (7.6.24) 
i=1 
(iii) g is semisimple, the inner product is symmetric, and there exists an 
involution g > g:&€ ++ &* and a constant c > 0 such that (7.6.19) holds. 
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Proof. By Lemma 7.5.20 the element P = 1 € Yo(g) is a critical point of 
the function f, in (7.6.9) if and only if, for all A € sl(g), 


O= >> (lei, es], Ales, eg] — [Aes, es] — les, Aes]) 


I 


trace (ad(e:)* Aad (ey) — 2ad(e;)*ad(e;)A) ; 


This holds if and only if there exists a constant c € R that satisfies (7.6.24). 
Thus we have proved that (i) is equivalent to (ii). 

We prove that (i) and (ii) imply (iii). Since P = lis a critical point of fy, 
Lemma 7.5.20 asserts that the inner product on g is symmetric. Thus by 
Lemma 7.6.8 there exists an involution g + g: € ++ &* that satisfies (7.6.4). 
Hence by (ii) there exists a real number c such that — 


m 
Oei= Y (2ad(e})ad(er) - ad(e;)ad(c})) = ell, (7.6.25) 
i=1 
Since g is not abelian, the endomorphism Q, has a positive trace, so c > 0. 
This implies that the center of g is trivial, because Z(g) C ker(Qg). Hence, 
by Lemma 7.6.8, the Killing form on g is nondegenerate. By Lemma 7.6.19 
this implies that the decomposition g = € @ p in Corollary 7.6.18 is or- 
thogonal. Hence the orthonormal basis e1,...,€m of g can be chosen such 
that e1,...,e% is a basis of € and e%41,...,€m is a basis of p. Thus e7 = —e; 
for i < k and e* = e; for i > k, and so it follows from (7.6.25) that 


Yaale Jad(e;) = cll. (7.6.26) 


Hence x(E*,n) = Doy(ei, ad(E*)ad(n)es) = Y,(€,ad(ef)ad(e;)n) = c(€,n) for 
all €,7 € g. This proves (iii). 

That (iii) implies (ii) follows by reversing this argument. By (iii) the 
Killing form is nondegenerate and the inner product is symmetric and is 
preserved by the involution € +> €*. Thus the splitting g = € @ p is orthog- 


onal, and hence the orthonormal basis e1,...,@m of g can be chosen such 
that e1,...,e% is a basis of € and ex44,...,@€m is a basis of p. Moreover, 
YS" (&, ad(e7)ad(ei)n) = (es, ad(E*)ad(n)es) = K(€*, n) = e(€,n) 
i=l i=1 


for all £,7 € g by (7.6.19). This implies (7.6.26). Since e* = +e; for all 4, 
equation (7.6.24) follows from (7.6.26). This proves Theorem 7.6.20. 
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Corollary 7.6.21 (Cartan Decomposition). Let g be a semisimple real 
Lie algebra equipped with a symmetric inner product, let g > 9: € ++ &* be 
the involution in Lemma 7.6.8, and define 


K := Auto(g) MSO(g), Dert (g) := {6 € Der(g)|6 = 6*}. 


Then the following holds. 

(i) K ts connected and is a maximal compact subgroup of Auto(g), every 
compact subgroup of Auto(g) is conjugate in Auto(g) to a Lie subgroup of K, 
and every maximal compact subgroup of Auto(g) is conjugate to K. 


(ii) The map 
K x Der*(g) > Auto(g) : (u, 6) + exp(d)u 


is a diffeomorphism. 
(iii) If there exists ac >0 such that K(€*,n) = c(E,n) for all €,n € g, then 


Mg := Crit (fy) = Po(g) N Aut(g) = {exp(d) | 5 € Der* (g)} 


is a totally geodesic and geodesically convex submanifold of Ao(g) and so is 
a Hadamard manifold and a symmetric space, and the map 


Auto(g) > Yo(g): 9 4 Vag" 
descends to a diffeomorphism from the quotient space Auto(g)/K to Mg. 


Proof. By Theorem 7.6.16 there exists a splitting g=g91 ®---@gr, into 
simple ideals, this splitting is preserved by every derivation of g, and by 
Lemma 7.6.19 it is also preserved by the involution € ++ €*. Thus Auto(g) is 
isomorphic to the product of the groups Auto(g;), and K = Auto(g) 9 SO(g) 
is isomorphic to the product of the subgroups K; := Autgo(g;) A SO(g;). 
Hence part (i) follows from Theorem 7.6.11. Moreover, by Lemma 7.6.19, 
we have p = p; ®--- @p,, and so part (ii) follows from Corollary 7.5.27. 
Under the assumptions of (iii) Theorem 7.6.20 asserts that Po = lisa 
critical point of fy, so (iii) follows from Lemma 7.5.22, Lemma 7.5.24, and 
Corollary 7.5.27. This proves Corollary 7.6.21. 


Remark 7.6.22. The Lie algebra of the group K = Auto(g) M SO(g) in 
Corollary 7.6.21 is given by Lie(K) = {ad(&) | € € €} (see Corollary 7.6.18). 
If the summand p in (7.6.22) is trivial, then Autg(g) = K is a compact Lie 
group. If p is nontrivial, then the quotient space Auto(g)/K is a nontrivial 


Hadamard manifold diffeomorphic to p. 
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Remark 7.6.23. One can replace the space Yo(g) of positive definite self- 
adjoint vector space isomorphisms P : g — g of determinant one by the 
space #4 of inner products on g with a fixed determinant (Remark 6.5.10). 
This eliminates the dependence on the background inner product and there 
is then only one function fy : H% — R whose set of minima is the totally 
geodesic submanifold .4, C AG of all symmetric inner products on g with a 
fixed determinant that satisfy part (iii) of Theorem 7.6.20. The main result 
asserts that, when g is not abelian, the space .4, nonempty if and only if g 
is semisimple (Theorem 7.6.16 and Theorem 7.6.20). 


7.6.4 Complex Lie Algebras 


A complex Lie algebra is a complex vector space g equipped with a Lie 
bracket g x g > g: (€,7) +> [€,7] that is complex bilinear, i.e. it is a skew- 
symmetric bilinear map that satisfies the Jacobi identity and 


lig, n] = [€, in] = i[€, n] 


for all €,7 € g. Thus every complex Lie algebra is also a real Lie algebra. 
Let g be a finite-dimensional complex Lie algebra. A complex ideal in g 
is a complex linear subspace § C g that satisfies [€,7] € 6 for all € € g and 
all 7 € h. The complex Lie algebra g is called simple iff it is not abelian and 
has no complex ideals other than h = {0} and h = g. It is called semisimple 
iff it is finite-dimensional and the complex Killing form K°:g x g—>C, 
defined by K°(€,7) := trace©(ad(&)ad()) for €,7 € g, is nondegenerate. 
Since « = 2Rexk‘, a complex Lie algebra is semisimple if and only if it is 
semisimple as a real Lie algebra. The next lemma shows that the analogous 
assertion holds for simple complex Lie algebras (see [17, Lemma 7]). 


Lemma 7.6.24. A finite-dimensional complex Lie algebra g is simple if and 
only if it is simple as a real Lie algebra. 


Proof. Let g be a simple complex Lie algebra of complex dimension n and 
let h C g be a real linear subspace of g that satisfies [€,7] € h for all € € g 
and all 7 € §. Then the subspaces 


bib, h + ib 
are complex ideals in g, and their real dimensions satisfy the equation 
dim™(h N ih) + dim™(h + ih) = 2dim™(h). 


Since both summands on the left are either 0 or 2n, the real dimension of h 
is either 0, n, or 2n. We claim that the dimension cannot be n. 
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Assume, by contradiction, that dim™(h) = n. Then 
bib = {0}, b+ib=g. 
Hence, for all ¢,¢’ € g there exist €,7 € such that ¢ = € + in, and so 
[6,6] = [€,0'] + In, i’) = i([, -i¢’] + [n, 1) € 6. ib = {0}. 


This contradicts the fact that g is not abelian. Thus h = {0} or h = g, and 
this proves Lemma 7.6.24. 


Lemma 7.6.25. Let g be a semisimple complex Lie algebra. Then g is a 
direct sum of simple complex ideals. 


Proof. Let § Cg be a real ideal and let h’ C g be as in (7.6.18). Then 
it follows from “(ii) == > (iii)” in Theorem 7.6.16 that [h + ib, h’] = {0}. 
Hence h + if Cc h” = h and so ih = h. Thus every real ideal in g is a complex 
ideal. Hence the assertion follows from “(ii) => (iv)” in Theorem 7.6.16. 


The next result is a theorem of Cartan [13] which asserts that every 
semisimple complex Lie algebra has a compact real form. The proof given 
here is due to Donaldson [17, Lemma 8]. 


Theorem 7.6.26 (Cartan). Let g be a semisimple complex Lie algebra 
equipped with a symmetric inner product. Then 


p=it, g=toit, 7.6.27) 


( 
where €,p are as in Corollary 7.6.18. Moreover, the group pa ) is the 
complexification of the maximal compact subgroup K = Auto(g) A SO(g), 2 

( 


Auto(g) = {exp(id)u| u € K, 6 € Lie(K)} 7.6.28) 
and the map K x Lie(K) > Auto(g) : (u,d) + exp(id)u is a diffeomorphism. 


Proof. Assume first that g is simple. Then g is simple as a real Lie algebra 
(Lemma 7.6.24), and so has a nondegenerate Killing form (Theorem 7.6.10). 
By the inclusions in (7.6.23) the subspace 


h := (€N ip) + (pn it) = (€ NM ip) + i(€/N ip) 


is a complex ideal in g. Hence it is either {0} or g. If h = g, then it follows 
from (7.6.22) that p = it. Assume, by contradiction, that h = {0}. Then 


eNnip={0}, g=t@ip. 
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Hence the map o : g > g, defined by 


a(€ + in) = €—in 


for € € € and 7 € p is a Lie algebra homomorphism and an involution. This 
implies ad(a(¢)) = aad(¢)o and so K(a(¢),0(¢’)) = K(C, ¢’) for all ¢,¢' € g. 
Hence «(€,i7) = 0 for all € € € and all 7 © p. Thus ip is the orthogonal 
complement of € with respect to the Killing form. Thus, by Corollary 7.6.18, 


p = ip. 
However, for all 7 € p we have «(in,in) = —K(n,n) = —|ad(n)|?. Hence 
the Killing form is negative definite on ip and positive definite on p, and 
hence p = {0}. This implies € = it. Since the Killing form is positive definite 
on i€ and negative definite on €, this is a contradiction. This contradiction 
shows that our assumption h = {0} must have been wrong. Thus h = g and 
hence p = it. This completes the proof of (7.6.27) in the simple case. 
For general semisimple complex Lie algebras, the proof of the equations 
in (7.6.27) reduces to the simple case by Lemma 7.6.19 and Lemma 7.6.25. 
Equation (7.6.28) follows directly from (7.6.27) and Corollary 7.6.21. This 
proves Theorem 7.6.26. 


Remark 7.6.27. Let g be a semisimple complex Lie algebra. Then Aut(g) 
is a complex Lie group, i.e. it admits the structure of a complex manifold 
such that the structure maps 


GxGoG:(h,g) HK hg, CSGigug 


are holomorphic. The proof uses the fact that Der(g) is isomorphic to g and 
that the resulting almost complex structure on Aut(g) is integrable (as it is 
preserved by the torsion-free connection g~!Vg = 4 (9-19) + [g-1g, g-1g]). 
Theorem 7.6.26 asserts that the identity component G = Auto(g) of the 
group of automorphisms of g is the complexification of the maximal com- 
pact subgroup K = Auto(g) MSO(g), ie. its Lie algebra Der(g) = ad(g) 
is the complexification of the Lie algebra Lie(K) = ad(€) and the quo- 
tient space G/K is contractible. These conditions imply the universality 
property that every Lie group homomorphism p: K > Y with values in a 
complex Lie group ¥Y extends to a unique holomorphic Lie group homo- 
morphism p°:G—¥Y such that p°|k = p. Such a complexification exists 
for every compact Lie group K, whether or not it is semisimple. (For an 
exposition see [20, Appendix B].) 
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The following exercise is inspired by a remark in [17, §5.1] concerning a 
positive curvature manifold that is dual to Mg. 


Exercise 7.6.28. Let g be a semisimple real Lie algebra equipped with 
a symmetric inner product that satisfies condition (iii) in Theorem 7.6.20. 
Consider the complexified Lie algebra 


g :=g9 Gig 
with the Lie bracket 
[6,67] == [6,1] — fn. n'] +i, 0'] + [n, €1) (7.6.29) 
and the Hermitian form 
(6,07)° = (EE) + (nn!) + i((E,0') — (0, €)) (7.6.30) 


for ¢=&€+in € g° and (' = €' + in! € g®. With this convention the Hermi- 
tian form (7.6.30) is complex anti-linear in the first variable and complex 
linear in the second variable. Prove the following. 


(a) g° is semisimple. Hint: g° has a trivial center and the real part 
of (7.6.30) is a symmetric inner product on g°. 


(b) If g is simple, then g° is simple. Hint: If 5° is a complex ideal in g°, 
then the linear subspace 


h = {Re(¢)|¢ € b°} 


is an ideal in g. 
(c) Every real linear derivation on g° is complex linear and has complex 
trace zero. 


(d) The identity component Auto(g°) of the group of real linear Lie algebra 
automorphisms of g° consists of complex linear automorphisms of complex 
determinant one. (Complex conjugation is a Lie algebra automorphism of g° 
not in the identity component.) 


(e) The subgroup 
K® := Auto(g°) 1 SU(g*) 


is connected and is a maximal compact subgroup of Auto(g°). Its Lie algebra 
Lie(K°) & € + ip 


is the compact real form of g° (Corollary 7.6.18 and Theorem 7.6.26). 
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(f) Let 6 + iW : g > g° bea Lie algebra homomorphism, i.e. for all €,7 € g, 


GE, 7] = (PE, On] —[VE, Un], —- WIE, n| = [@E, Un] + [VE, On]. 


Assume ® + iW is injective and denote its image by 


(:= {BE + iWE|E € gh. 


Then the following are equivalent. 
(I) The real part of (7.6.30) restricts to a symmetric inner product on lI. 
(II) ®*6 + U*W © Aut(g). 


(III) is a Lagrangian subspace of g° with respect to the imaginary part 
of (7.6.30), ie. O*U — W*S = 0. 


Hint: If ®*® + W*W © Aut(g), then there exists a derivation a € Der(g) 
such that a = a* and exp(2a) = ®*@ + W*W (Corollary 7.6.21). Prove that 


6 := exp(—a)(®*V — U*®) exp(—a) 
is a derivation whose image is abelian. Prove that 
K(d€, 67) = 0 


for all €,7 € g and deduce that 6*6 = —d? = 0. 


(g) The space of oriented Lagrangian Lie subalgebras [ C g° isomorphic to g 
(that can be joined to g by a path of such subspaces) is diffeomorphic to the 
quotient space 

dig 3= KK" KK, 


where K° := Auto(g°) N SU(g°) and K := Auto(g) N SO(g). Hint: Choose 
the embedding ® + iW in (f) such that 


Oh + U*U = I, eV —W*d=0, 


and extend it to a unitary automorphism of g°. 


(h) The space Lg in part (g) embeds as a totally geodesic submanifold 
of dimension dim(L,) = dim(p) into the symmetric space U(g°)/SO(g) of 
all oriented Lagrangian subspaces of g°. Hence Lg has nonnegative sectional 
curvature. (Hint: Example 5.2.17.) One can think of the positive curvature 
manifold Ly of all Lagrangian Lie subalgebras of g° that are isomorphic 
to g as dual to the negative curvature manifold M, of all symmetric inner 
products on g that satisfy condition (iii) in Theorem 7.6.20. 
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Remark 7.6.29. The idea of minimizing the norm of the Lie bracket was 
the approach to the existence of a compact real form of a complex Lie algebra 
suggested by Cartan [14] and carried out by Richardson [58]. One significant 
difference in the method of Donaldson [17], which we follow in the section, 
is that there is no need to assume that the Killing form is nondegenerate, 
but that this result emerges as a byproduct of the proof (Theorem 7.6.10). 
There is also no need to use the structure theory of Lie algebras as in the 
work of Weyl [77]. Instead one can use the existence of a symmetric inner 
product as a starting point to develop the structure theory of Lie algebras. 


Remark 7.6.30. As pointed out by Donaldson [17], a more direct approach 
to Theorem 7.6.26 would be to carry over the entire program in the present 
section and §7.5 to the complex setting, starting in §7.5.2 with convex func- 
tions on the Hadamard manifold M = 2o(V) = SL(V)/SU(V) of positive 
definite Hermitian automorphisms with determinant one of a complex vector 
space V equipped with a Hermitian inner product (Remark 6.5.19). 

In the complex Lie algebra setting with Qo(g) = SL(g, C)/SU(g, C) the 
logarithm of the function fy : Zo(g) > R is the log-norm function of Kempf 
and Ness in geometric invariant theory [38, 20]. Thus the existence of a 
critical point of fg is the polystability condition in GIT. This approach was 
developed by Lauret [44] and he proved that the polystable points are pre- 
cisely the semisimple Lie algebras (Theorem 7.6.20). His proof uses Cartan’s 
theorem about the compact real form of a semisimple complex Lie algebra. 

One can also deduce the theorems in the real setting from those in the 
complex setting by complexifying the relevant real inner product space V 
to obtain a complex vector space V° = V @iV with a Hermitian inner 
product, and embedding the space A(V) ~ SL(V)/SO(V) as a totally 
geodesic submanifold into Qo(V°) = SL(V*°)/SU(V*). 


Appendix A 


Notes 


A.1 Maps and Functions 


The notation 
fi: x7 Y 


means that f is a function which assigns to every point x in the set X a 
point f(a) in the set Y. When Y = R we express this by saying that f is 
a real valued function defined on the set X and, if Y is a vector space, we 
may say that f is a vector valued function. However in general it is better 
to say that f is a map from X to Y and call the set X the source of the 
map and the set Y its target. The graph of f is the set 


graph(f) = {(v,y) €X x Y|y= f(z)}. 


We always distinguish two maps with the same graph when their targets are 
different. 
A map f:X > Y is said to be 


injective f(a) =f eo) Say Hae 
surjective iff Vye Yare X st. y= f(x) 
bijective it is both injective and surjective. 


Then 

(a) f is injective <=> it has a left inverse g: Y > X (ie. go f = idx); 
(b) f is surjective <=> it has a right inverse g: Y > X (ie. fog = idy); 
(c) f is bijective <== it has a two sided inverse f-': Y > X. 


(Item (b) is the Axiom of Choice.) 
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The analogous principle holds for linear maps: if A € R™*”, then the 
linear map R” > R™ : 2+ Az is 


(a) injective <= BA=1, for some B € R"*”; 
(b) surjective — > AB=1,, for some B € R"*”; 
(c) bijective <= A is invertible (ie. m =n and det(A) 4 0). 


(Here I, is the k x k identity matrix.) However, this principle fails com- 
pletely for continuous maps: the map f : [0,27) > S! defined by f(@) = 
(cos 6,sin@) is continuous and bijective but its inverse is not continuous. 
(Here S! C R? is the unit circle x? + y? = 1.) 


A.2 Normal Forms 


The Fundamental Idea of Differential Calculus is that near a point 79 € U 
a smooth map f : U > V behaves like its linear approximation, i.e. 


f(x) © f(ao) + df (xo) (a — 0). 


The Normal Form Theorem from Linear Algebra says that if A € R™*” has 
rank r, then there are invertible matrices P € R™*’™ and Q € R”*” such 


that 
P-'AQ = ( 1, O-x(n—r) ) : 
O(m—r) xr O(m—r)x(n—r) 


By the Fundamental Idea we can expect an analogous theorem for smooth 
maps. 


Theorem A.2.1 (Local Normal Form for Smooth Maps). Let U Cc R" 
and V C R™ be open, x9 € U, and f: U + V be smooth. Assume that the 
derivative df(xo) € R™*" has rank r. Then there is an open neighborhood 
Uo of xo in U, an open neighborhood Vo of f(xo) in V, a diffeomorphism 
ob: Uy x Ug CR” x R"”, a diffeomorphism w : Vo > Uy x V2 C R" x R™”, 
such that $(xo) = (0,0), w(f(xo)) = (0,0), and 


yo fod(a,y) =(x,g9(@,y)) and — dg(0,0) =0 
for (x,y) € Uy x U2. 


The Local Normal Form Theorem is an easy consequence of the Inverse 
Function Theorem. 
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Theorem A.2.2 (Inverse Function Theorem). Let U C R", V C R™, 
zo €U and f :U > V be a smooth map. If df (xo) is invertible, then (m =n 
and) there are neighborhoods Up of xo inU and Vo of f(x) in V so that the 
restriction fiy, : Uo + Vo ts a diffeomorphism. 


Here follow some other consequences of the Inverse Function Theorem. 
The terms submersion and immersion are defined in 82.6.1 and Defini- 
tion 2.3.2 of §2.3. 


Corollary A.2.3 (Submersion Theorem). When r = m the diffeomor- 
phisms ¢ and w in Theorem A.2.1 may be chosen so that the local normal 
form is 


pio fodg(a,y) =a. 


Corollary A.2.4 (Immersion Theorem). When r = n the diffeomor- 
phisms 6 and w in Theorem A.2.1 may be chosen so that the local normal 
form is 


v*o fo g(z) = (2,0). 


Corollary A.2.5 (Rank Theorem). [f the rank of df(x) =r for allx € U, 
then for every xo € U the diffeomorphisms ¢ and W in Theorem A.2.1 may 
be chosen so that the local normal form is 


w*o fo g(a) = (2,0). 


Corollary A.2.6 (Implicit Function Theorem). Let U Cc R™ x R” be 
an open set, let F: U +R” be smooth, and let (xp, yo) € U with xo € R™ 
and yo € R". Define the partial derivative d2F (xo, yo) € R"*” by 


d 
dF (xo, yo)v = dt F (x0, Yo + tv) 
t t=0 


forv ER”. Assume that F(x, yo) =0 and that d2F (xo, yo) is invertible. 
Then there exist neighborhoods Up of x9 in R™ and Vo of yo in R” and a 
smooth map g: Ug > Vo such that 


Uo x Vo CU, g(xo) = Yo 


and 
F(z, y) =0 = y = g(x) 


forx € Up andy € Vo. 
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A.3 Euclidean Spaces 


This is the arena of Euclidean geometry; i.e. every figure which is studied in 
Euclidean geometry is a subset of Euclidean space. To define it one could 
proceed axiomatically as Euclid did; one would then verify that the ax- 
ioms characterized Euclidean space by constructing “Cartesian Coordinate 
Systems” which identify the n-dimensional Euclidean space E” with the n- 
dimensional numerical space R”. This program was carried out rigorously 
by Hilbert. We shall adopt the mathematically simpler but philosophically 
less satisfying course of taking the characterization as the definition. 

We shall use three closely related spaces: n-dimensional Euclidean affine 
space &”, n-dimensional Euclidean vector space E”, and the space R” of 
all n-tuples of real numbers. The distinction among them is a bit pedantic, 
especially if one views as the purpose of geometry the interpretation of 
calculations on R”. The purpose for distinguishing these three spaces is the 
same as in elementary vector calculus; it aids geometric intuition. Here is 
the precise definition. 


Definition A.3.1. An n-dimensional Euclidean vector space is a real n- 
dimensional vector space E” equipped with a (real valued symmetric positive 
definite) inner product E” x E” > R: (v,w) + (v,w). An n-dimensional 
Euclidean affine space consists of a set £” and an n-dimensional Euclidean 
vector space E” and maps 


BE EE ep) > p= @, 
E” x E” > E” :(p,v) 6 p+u 


satisfying the axioms 
p+0=p, pt+w)=(p+u)+u, g+(-aq=p 


for all p,q € E” and all v,we EK”. The vector p—q € E” is called the 
vector from q to p and the point p+ v is called the translate of p by v. 
It follows easily that each choice of a point o € E£” determines a bijection 
vt>o+t+v from E” onto E”. The inner product on E” equips the space E” 
with a metric via the formula 


lp—ql= V(p—-4a,p—49), p,q € E”. 


The standard Euclidean space of dimension n is E” = E” = R” with the 
usual matrix algebra operations (x +y)' =a' +y’, (x,y) =o, 2'y’. 
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Lemma A.3.2. Any choice of an origin o € E” and an orthonormal ba- 
51S €1,...,€n for E” determines an isometric bijection: 


n 
R" > BE”: (a',...,2") Hot) ae; 
i=1 


(the inverse of which is called a Cartesian coordinate system on E”). 


Lemma A.3.3. If E° > R”: p+ (a1,...,2"), (y',...,y”) are two Carte- 
sian coordinate systems, the change of coordinates map has the form 


where the matrix a = (a?) € R"*” is an orthogonal matrix and v € R”. 


Example A.3.4. Any n-dimensional affine subspace of some numerical 
space R* (with k > n) is an example of a Euclidean space. The corre- 
sponding vector space E” is the unique vector subspace of R* for which: 


BE” =0+E”" 


for o € E£”. This subspace is independent of the choice of o € E”. Note 
that E” contains the “preferred” point 0 while E” has no preferred point. 
Such spaces E” and E” would arise in linear algebra by taking E” to be 
the space of solutions of k — n independent inhomogeneous linear equations 
in k unknowns while E” is the space of solutions of the corresponding ho- 
mogeneous equations. The correspondence between E” and E” illustrates 
the mantra 


The general solution of an inhomogeneous system of linear equa- 
tions is a particular solution plus the general solution of the cor- 
responding homogeneous linear system. 


This discussion shows that a Euclidean space E” is an n-dimensional 
manifold with its Cartesian coordinate systems whose tangent space at each 
point is naturally isomorphic to E”. Thus it is natural to introduce sub- 
manifolds of Euclidean space as submanifolds of E” whose tangent spaces 
are then linear subspaces of the associated vector space E”. Instead we have 
chosen in this book for simplicity of the exposition to describe manifolds as 
subsets of the vector space R” equipped with its standard inner product. 
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isomorphism, 58 
right invariant metric, 243 
symplectic, 53, 387 
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Lie subgroup, 62 

lift of a curve, 142 

local coordinates, 4 

local diffeomorphism, 33 
local isometry, 230 

local trivialization, 76 
locally finite open cover, 101 
locally finite sum, 106 
locally symmetric space, 273. 


manifold, 8, 11, 16, 87 
complex, 13, 
developable, 269 
dimension, 8 
flat, 268 
oriented, 321 
osculating developable, 270 
real analytic, 13 
smooth, 11, 16, 87 
symplectic, 13 
tangent developable, 271 
topological, 8, 88 

maximal existence interval, 41 
for geodesics, 192 

maximal ideal in ¥(M), 70 

maximal smooth atlas, i 

maximal topological atlas, 9 

mean curvature, 272 

metrizable, 101 

minimal geodesic, 197 
existence, 199, 210 

minimal surface, 272 

Mobius strip, 78, 123, 272 
paper model, 271 

Monge coordinates, 28 

Morse index, 328 

motion, 146 
without sliding, 149 
without twisting, 151 
without wobbling, 152 


423 


moving frame, 139, 142 
Myers-Steenrod Theorem, 341 


normal bundle, 76 


1-form with values in TM, 240 
oriented manifold, 321 
orthogonal group, 23 


paraboloid, 271 

elliptic, 6 

hyperbolic, 6 
paracompact, 101 
parallel 

transport, 131 

vector field, 129 
parametrized 

by the arclength, 182 

proportional to arclength, 177 
partition of unity, 103 

existence, 103 

subordinate to a cover, 103 
Pliicker embedding, 96 
Pliicker’s conoid, 271 
Poincaré metric, 290 
principal bundle, 115, 140 
projection 

obvious, 9 

orthogonal, 75 
projective space, 7 

complex, 89, 

real, 89 
proper, 34 
pullback 

of a vector bundle, 76 

of a vector field, 46 
pushforward 

of a vector field, 46 


quadric surface, 5 
quaternions, 53 
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quotient space, 9 
quotient topology, 9 


real projective space, 22, 89 

refinement, 101 

regular value, 21, 32 

relative topology, 9 

relatively closed, 9 

relatively open, 9 

Ricci tensor, 308 
in local coordinates, 308 
positive, 309 

Riemann curvature tensor, 232 
covariant derivative, 275 

in local coordinates, 278 

first Bianchi identity, 239 
in local coordinates, 254 
second Bianchi identity, 276 

Riemann surface, 102 

Riemannian manifold, 166 
Einstein, 317 
flat, 268 
homogeneous, 274 
locally symmetric, 273 
symmetric, 273 

Riemannian metric, 13, 166 
bi-invariant, 242 
conformal class, 316 
distance function, 183, 217 
existence, 168 

Roman surface, 22 

ruled surface, 271 


scalar curvature, 312 
constant, 315 
Yamabe problem, 316 
in local coordinates, 314 
positive, 315 
second Bianchi identity, 276 
second countable, 101 
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second fundamental form, 125 
section of a vector bundle, 74 
sectional curvature, 280 
constant, 281 
nonpositive, 293 
positive, 311 
self-dual 
four-manifold, 324 
Weyl tensor, 324 
Siegel upper half space, 307 
simple Lie algebra, 388 
simply connected, 258 
singular value, 21 


Smirnov Metrization Theorem, 102 


smooth, 10 
atlas, 11 
group action, 68 
groupoid, 340 
manifold, 11, 16, 87 
map, 10, 15, 92 
structure, 12, 88 
vector bundle, 74 
smoothly compatible, 11 
special 
linear group, 23, 57 
unitary group, 57 
sphere, 5 
Sphere Theorem, 311 
spray, 191 
geodesic, 191 
stereographic projection, 3 
structure group, 140 
subbundle, 80, 99 
integrable, 80 
involutive, 80 
submanifold, 34, 95 
totally geodesic, 302 
submersion, 72 
surface, 251 
quadric, 5 
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symmetric space, 273 
is homogeneous, 274 
locally, 273 
symplectic vector space, 387 
compatible inner product, 387 
compatible linear complex 
structure, 387 


tangent bundle, 74, 97 
coordinate chart, 74, 97 
integrable subbundle, 80 
involutive subbundle, 80. 
of the frame bundle, 141 
of the tangent bundle, 191 

tangent space, 24, 93 
horizontal, 142, 173. 
of the frame bundle, 141 
of the tangent bundle, 191 
vertical, 142 

tangent vector, 24, 93 

Theorema Egregium, 245, 253. 

topological atlas, 8 

topological manifold, 8, 88 

topological space 
countable base, 101 
Hausdorff, 91 
locally compact, 102 
metrizable, 101 
normal, 102 
paracompact, 101 
regular, 102 
second countable, 101 
o-compact, 101 

topologically compatibe, 8 

topology 
compact-open, 341 
intrinsic, 88 
quotient, 9 
relative, 9 

torus, 90 
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totally geodesic submanifold, 302 


transition map, 8, 11, 87 

2-form, 321 
anti-self-dual, 322 
self-dual, 322 

Tychonoff’s Lemma, 102 


uniformization theorem, 315 


unitary group, 57 


Urysohn Metrization Theorem, 102 


variation of a curve, 176 
vector bundle, 74 


canonical pro jection, 74 


local trivialization, 76 
section, 74 

vector field, 38, 97 
along a curve, 127 
along a map, 170 
basic, 145, 173. 
flow, 41 
integral curve, 39 
Killing, 345 
Lie bracket, 49 
normal, 135 
parallel, 129 


with compact support, 43. 


Veronese embedding, 96 
vertical tangent space, 142 


wallpaper group, 358 
Weyl tensor, 318 


in local coordinates, 320 


self-dual, 324 


Whitney Embedding Theorem, 112 


Yamabe problem, 316 


